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, (54) Title: COMPOSITIONS AND METHODS FOR MAKING MUTATIONS IN CELL LINES AND ANIMALS 
00 (57) Abstract: The present invention is directed generally to reduction or inactivation of gene function or gene expression in cells 



in vitro and in multicellular organisms. The invention encompasses methods for mutating cells using a combination of mutagens, 
particularly wherein at least one mutagen is an insertional mutagen, to achieve homozygous gene mutation or mutation of multiple 
genes required cumulatively to achieve a phenotype to create knock-outs, knock-downs, and other modifications in the same cell. 
The invention is also directed to cells (and libraries thereof) and organisms created by the methods of the invention, including those 
in which at least one of the genes created by insertional mutagenesis is tagged by means of the insertion sequences thereby allowing 
identification of the mutated gene(s)- The invention is also directed to libraries of mutated cells and their uses. The invention is also 
directed to methods of identif5dng mutations with methods of the invention, in cells (and libraries thereof) and organisms, bymeans 
of the insertional tag 
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COMPOSITIONS AND METHODS FOR MAKING MUTATIONS 
IN CELL LINES AND ANIMALS 



CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application claims the benefit of U.S. Provisional Appl. No. 
60/336,497 filed October 22, 2001 and U.S. Application No. 10/196,721 filed July 15, 
2002, the entire disclosures of which are incorporated herein by reference. 

BACKGROUND OF THE INVENTION 
Field of the Invention 

5 The present invention is in the fields of molecular biology, cell biology, and 

genetics. The invention is directed generally to mutating genes in cells in vitro and in 
multi-cellular organisms. The invention encompasses methods for mutating genes in 
cells using a combination of mutagens, wherein at least one mutagen is a polynucleotide 
that acts as an insertional mutagen. Such methods are used to achieve mutation of a 

10 single gene to achieve a desired phenotype as well as mutation of multiple genes, 
required cumulatively to achieve a desired phenotype, in a cell or multi-cellular 
organism. The invention is also directed to methods of identifying one or more mutated 
genes, made by the methods of the invention, in cells and in multi-cellular organisms, by 
means of a tagging property provided by the insertional mutagen(s). The insertional 

15 mutagen thus allows identification of one or more genes that are mutated by insertion of 
the insertional mutagen. 
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The invention is also directed to cells and multi-cellxilar organisms created by the 
methods of the invention and uses of the cells and multicellular organisms. The 
invention is also directed to libraries of cells created by the methods of the invention and 
uses of tiie libraries. 



5 Background 

Mutagenesis has been used to identify the ftmction of a large and growing 
number of genes. Mutation of one or more genes in a multi-cellular organism or cell 
allows the artisan to study the mutant organism or cell and compare it to the non- 
mutagenized (which may be wildtype) parent organism or cell. By identifying 

10 phenotypes associated with the mutant organism or cell, the function of the mutated 
gene(s) can be ascertained. Furthermore, mutagenesis provides a means for altering the 
genetic make up of a cell or multi-cellular organism to obtain a desired result. For 
example, it may be desirable to create a physiological disorder in a eukaryotic organism 
by mutating one more genes and then to identify one or more of the relevant genes. 

15 Thus, mutations that have a desired use (e,g,, for commercial production of proteins, 
foodstuffs, or pharmaceuticals, or for production of transgenic animals as models of 
certain diseases) can be identified and selected. The possibilities for use of this 
technology, whether in vitro^ ex vivo, or in vivo, are well known in the art. 

Identification of novel genes and characterization of their function using 
20 mutagenesis has also been shown to be productive in identifying new drugs and drug 
targets. Creating in vitro cellular models that exhibit phenotypes that are clinically 
relevant provides a valuable substrate for target identification and screening for 
compounds that modulate not only the phenotype but also the target(s) that controls the 
phenot3^e. Modulation of such a target can provide information that validates the target 
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as important for therapeutic intervention in a clinical disorder when such modulation of 
the target serves to modiilate a clinically relevant phenotype. 

Animal models exhibiting clinically relevant phenotypes are also valuable for 
drug discovery and development and for drug target identification. For example, 

5 mutation of somatic or germ cells facilitates the production of genetically modified 
offspring or cloned animals having a phenotype of interest. Such animals have a number 
of uses, for example as models of physiological disorders {e.g., of human genetic 
diseases) that are useful for screening the efficacy of candidate therapeutic compounds 
or compositions for treating or preventing such physiological disorders. Furthermore, 

10 identifying the gene(s) responsible for the phenotype provides potential drug targets for 
modulating the phenotype and, when the phenotype is clmically relevant, for therapeutic 
intervention. In addition, the manipulation of the genetic makeup of organisms and the 
identification of new genes have important uses in agriculture, for example in the 
development of new strains of animals and plants having higher nutritional value or 

15 increased resistance to environmental stresses (such as heat, drought, or pests) relative to 
their wildtype or non-mutant counterparts. 

Since most eukaryotic cells are diploid, two copies of most genes are present in 
each cell. As a consequence, homozygous mutation is usually required to produce a 
desired phenotype, since mutating one copy of a gene may not produce a sufficient 
20 change in the level of gene expression or activity of the gene product from that in the 
non-mutated or wildtype cell or multicellular organism, and smce the remaining 
wildtype copy would still be expressed at sufficient levels to produce a functional gene 
product. Thus, to create a desired change in the level of gene expression and/or function 
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in a cell or multicellular organism, at least two mutations, one in each copy of the gene, 
are required in the same cell. 

In other instances, mirtation in multiple genes may be required to produce a 
desired phenotype. In some instances, a mutation in one copy of a gene may affect the 
5 expression levels of the gene but not the activity of the gene product to a desired extent, 
so that the desired physiological effects on the cell or multi-cellular organism is not 
achieved. However, a mutation in a second gene, even in only one copy of that second 
gene, can reduce gene expression levels of the second gene to produce a cumulative 
phenotypic effect in combination with the first mutation, if the expression levels of both 
10 genes are sufficiently low. This effect can alter the function of a cell or multi-cellular 
organism. An example of this phenomenon is the synergy between blood clotting 
Factors VHI and IX. A mutation in either gene alone could result in levels that are 
severely reduced but with no effect on the clotting function. Severe reductions in the 
level of expression of both genes, however, can have a major impact. This principle can 
1 5 be extended to other instances where mutations in multiple (two, three, four, or more, for 
example) genes are required cumulatively to produce an effect on activity of a gene 
product or on another phenotype in a cell or multi-cellular organism. It should be noted 
that, in this instance, such genes may all be expressed in the same cell type and 
therefore, all of the required mutations occur in the same cell. However, the genes may 
20 normally be expressed in different cell types (for example, secreting the different gene 
products from the different cells). In this case, the gene products are expressed in 
different cells but still have a biochemical relationship such that one or more mutations 
in each gene is required to produce the desired phenotype. 
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Unfortunately, few methods exist for creating cultured cells that contain multiple 
gene mutations that produce, cumulatively, a desired phenotype. Such methods often 
are time-consiiming and prone to error. In addition, it is often very difficult or 
impossible to identify the genes that have been mutated using such methods. 

5 Further, methods for making homo2ygous mutations in cultured cells, where the 

mutated genes are not known in advance of mutation, are not known to ctirrently exist. 
Still fijrther, without a way to identify a homozygous mutation, the artisan cannot 
associate the phenotype with a given mutation. Currently, to associate a desired 
phenotype with a homozygous mutation in a cultured cell, the location, structure and/or 

10 fiinction of the gene must be known to the artisan in advance. Hence, the methods of 
mutation known in the art are not suitable for homozygously mutating a cell to achieve a 
desired phenotype and identifying the gene(s) responsible for the phenotype. Nor are 
there methods suitable for making cells with multiple mutations that cumulatively 
produce a desired phenotype and identifying the genes responsible for liie phenotype. 

15 Several approaches for introducing mutations into eukaryotic genes are currently 

in use. Each has significant limitations. 

One approach is homologous recombination to mutate the level of gene 
expression or activity of a gene product in a cell. 1: Montgomery et al.. Cell. 1991 Feb 
22;64(4):693-702; 2: te Riele et al. Nature. 1990 Dec 13;348(6302):649-51; 3: Mansour 
20 et. al., Proc Natl Acad Sci USA. 1990 Oct;87(19):7688-92; 4: KoUer et al., Proc Natl 
AcadSciUS A. 1989Nov;86(22):8927-31; 5: Capecchi MR. Science. 1989 Jun 
16;244(4910):1288-92; 6: Zimmer A, Gruss P. Nature. 1989 Mar 9;338(6211):150-3; 7: 
Joyner AL, Skames WC, Rossant J. Nature. 1989 Mar 9;338(6211): 153-6; 8: 
Thompson S, Clarke AR, Pow AM, Hooper ML, Melton DW. Cell. 1989 Jan 
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27;56(2):313-21;9: DoetschmanT, MaedaN, Smithies O. Proc Natl Acad Sci USA. 
1988 Nov;85(22):8583-7; 10: Doetschman T, Gregg RG, Maeda N, Hooper ML, 
Melton DW, Thompson S, Smithies O. Nature. 1987 Dec 10-16;330(6148):576-8; 11: 
Thomas KR, Capecchi MR. Cell. 1987 Nov 6;51(3):503-12. 

5 Typically, this approach is taken in embryonic stem cells or embryonic germ 

cells, which are used to make transgenic animals carrying the mutation of interest. An 
important limitation of this approach is that the gene to be mutated must be known in 
advance of mutation, cloned and sequenced to ensure that the mutagenic vector used in 
homologous recombination contains the appropriate targeting sequences. Furthermore, 

10 the process is laborious and results in only one mutant copy of the gene of mterest in the 
cell. Where a phenotype depends on homozygosity for expression, the heterozygous 
cell, therefore, cannot be used to screen for a change in a phenotype of interest unless 
additional work is carried out to eliminate the second copy of the gene by homologous 
recombination. This additional work is tune consuming and expensive, and more 

1 5 importantly can only be done on genes that are known to the artisan in advance. 

Such mutated heterozygous cells can be used to make transgenic animals. 
However, such animals will also be heterozygous and may not express a phenotype 
different from the wildtype or nomnutant animal. Fvirfher breeding of the animals to 
homozygosity is therefore required if one desires to analyze the phenotypic effect of the 
20 mutation. Such breeding is time consuming and expensive. 

Another approach involves chemical mutagenesis of cells and/or organisms (see, 
e.g.. Brown et al. Hum. Mol Genet. 7:1627-1633 (1998); Chen et al. Nature Gen. 
24:3U-?>\1 (2000); Munroe et al. Nature Gen. 2¥:318-321 (2000); Nolan et al. Nature 
Gen. 25:440-443 (2000); the disclosures of all of which are incorporated herein by 
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reference in their entireties for teaching the use of ENU to generate mutations that result 
m detectable phenotypes in cells or animals). This approach relies upon the use of one 
or more chemical mutagens that are able to produce one or more mutations in the 
genome. As is the case for mutation by homologous recombination, however, chemical 

5 mutagenesis also typically results in mutagenesis of only a single copy of a given gene. 
Since in cases where homozygous mutation is required to achieve a desired phenotype, 
both copies of a given gene must be mutated before a desired phenotype can be 
achieved, cells or organisms that undergo a single round of chemical mutagenesis 
typically do not show a desired change in phenotype. Hence, these cells or organisms 

1 0 generally are not useful for achieving for a desired phenotype. 

A ftirther problem is that while chemical mutagenesis results in the mutation of 
one or more genes in a cell, there is no straightforward way to determine the mutated 
gene(s) responsible for the phenotype. This approach also fails to provide a method for 
making multiple mutations that cumulatively provide a desired phenotype that also 
1 5 permits the genes responsible for the phenotype to be easily identified. 

As discussed above (for homologous recombination mutagenesis) mutated 
heterozygous cells prepared by chemical mutagenesis can be used to create transgenic 
animals. However, the animals will also be heterozygous and may not, therefore, 
manifest a change m a desired phenotype from the wildtype. Time-consuming and 
20 costly breeding of the animals to homozygosity is required. Even if a change in the 
desired phenotype is observed in the transgenic animals (even in homozygous transgenic 
animals), it is very difficult, if not impossible, to identify the mutated gene(s) 
responsible for the phenotype. Therefore, a large number of breedings must be carried 
out to clone the mutated gene by standard positional cloning methods. Hence, this 
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process is slow, expensive, difficult to carry out on large numbers of mutant animals, 
and has a Mgh failure rate. Thus, chemical mutagenesis fails to provide homozygous 
mutations in cultured cells (and hence, in transgenic animals produced from such cells) 
and fails to provide a simple way to identify the mutated gene(s) responsible for a 
5 phenotype in cultured cells or in multi-cellular organisms. 

Another approach that has been used to mutate genes involves the use of 
insertional mutagens, such as gene trap vectors, to mutate genes (e.g., Amsterdam et al. 
Genes Dev. 75:2713-2724 (1999); von Melcher et at. Genes Dev. 5:919-927 (1992); 
Gogos et al, J. Virol. 7i:1644-1650 (1997); Voss et al. Dev. Dyn. 272:171-180 (1998); 

10 Zambrowicz et al., Proc. Natl. Acad. Sci. USA P4:3789-3794 (1997); Friedrich et al. 
Genes Dev. 5:1513-1523 (1991); the disclosures of all of which are incorporated herein 
by reference in their entireties for teaching the use of gene traps as a mutagenesis 
technique). These vectors are typically inserted into the genome of a cell by non- 
homologous recombination. Upon insertion, these vectors are designed to disrupt 

15 transcription and/or translation of a gene. Unfortunately, gene trap vectors are 
inefficient mutagens and mutate only one copy of a given gene. As a result, 
homozygous mutations typically cannot be created in cell culture with such mutagens. 
In animals, the mutant animal must be bred to homozygosity of the mutant gene prior to 
phenotypic analysis. Since it is difficult and expensive to breed large numbers of 

20 animals to homozygosity, this approach has only been used on a relatively small number 
of genes to date. 

This approach also fails to provide a method for making multiple mutations that 
cumulatively provide a desired phenotype and where the genes responsible for the 
phenotype can be identified. The probabilities of achieving, in a single cell, insertions in 
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each of the genes required, is low and decreases with the number of genes required to be 
mutated in order to achieve the desired phenotype. Thus, gene traps fail to mutate 
multiple genes and fail to create homozygous mutations in cultured cells. 

Stark et al. (Human Molecular Genetics 8:1925-1938 (1999)), in a review article 
5 on forward genetics in mammalian cells with functional approaches to gene discovery, 
suggested a potential altemative to complementation of mutants by using expression 
libraries in order to clone the missing gene. They indicated that the altemative mvolves 
retrovirus-mediated insertional mutagenesis in conjunction with chemical mutagenesis. 
They indicated that it was impractical to use insertional mutagenesis de novo to 
10 inactivate two alleles of a target gene. However, there was no indication of how this 
might be achieved and the reference failed to disclose a description of the method 
having been carried out in practice. It was suggested to obtain a population of heavily 
mutagenized cells and then insert retroviruses into those cells to inactivate and mark the 
gene. However, there was no report of ttiis suggestion having been carried out, no 
15 guidance regarding how to perform either of the mutagenesis steps to achieve a 
successful homozygous mutation. 

Accordingly, there exists a need in the art to create homozygous gene mutations 
on a genome-wide basis in cell culture and in multicellular organisms without 
knowledge of the gene in advance. There is also a need to provide a way to identify the 
20 gene. There is also a need for a method of mutating multiple genes in a cell, required 
cumulatively to achieve a desired phenotype and to identify one or more of the mutated 
genes. The ability to mutate multiple genes or to mutate both copies of the same gene in 
cultured cells or multi-cellular organisms, coupled witli the ability to identify the mutant 
gene(s) would be a highly useful approach to identify novel genes, correlate genes with 
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functions, and use the mutant genes, their wildtype counterparts, and other variants, for 
example, in drug screening and development, transgenic animal and plant production 
and in the production of desirable gene products. 

BRIEF SUMMARY OF THE INVENTION 

5 The present invention provides, for the first time, a solution to tiie needs 

identified above by providing methods of efficiently mutating multiple genes in the 
same cell and tagging at least one of the mutated genes in cells that contain the mutated 
multiple genes, so that the identity of one or more of the mutated genes can be achieved. 
The present invention also provides methods of making homozygous gene mutations and 

10 tagging the mutated gene. 

In general terms, the present invention is directed to methods for creating 
mutated cells and multicellular organisms using two or more mutagens where at least 
one of the mutagens is a polynucleotide that acts as an insertional mutagen. The 
polynucleotide can be used as a tag to identify the cell containing tiie mutated gene 

15 and/or to identify the mutated gene itself. One aspect of the invention is directed to 
methods for mutating multiple genes, that cumulatively produce a desired phenotype, 
within the same cell, and tagging at least one of the mutated genes. Another aspect of 
the invention is directed to methods of creating one or more homozygous mutations in a 
cell, that sufficiently alter the mutated gene function to generate a desired phenotype, 

20 and that tag at least one of the mutated genes. 

One mutational approach used in accordance with the invention relates to 
methods of mutating cells using physicochemical mutagens. These methods have been 
shown to efficientiy produce mutations in cells, but cannot be used to tag the mutated 
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genes. Since the genes cannot be tagged identification of chemically mutated genes is 
difficult or impossible. 

The other mutational approach used in accordance with the invention relates to 
methods of mutating cells using insertional mutagens. These methods are known to 

5 create heterozygous gene mutations and, in certain cases, to tag the mutated gene. But 
these methods have not been shown to produce homozygous gene mutations within 
individual cells or to efficiently produce mutation of multiple genes in a cell which 
cumulatively produce a desired phenotype. Thus, the present invention utilizes 
physicochemical mutagenesis in conjunction with insertional mutagenesis to achieve 

10 multiple mutations (that cumulatively produce a desired phenotype) within a cell, such 
that at least one of the mutated genes in each cell containing the multiple mutations is 
tagged. Thus, one or more of the mutated genes can be identified by detecting the tag. 
In the case of homozygous mutations, at least one of the mutated copies of the gene is 
tagged so that the mutation that is responsible for the desired phenotype can be 

15 identified. 

According to the invention, physicochemical mutagens are used to mutate one or 
more genes in a cell. Mutations include, but are not limited to, pohit mutations, 
insertions, deletions, inversions, base modifications and translocations. Such mutations 
result in events such as splicing defects that alter exon utilization, firame shifts, 
20 truncation (5' and 30 of transcripts and of proteins (amino terminal and carboxy- 
terminal) and alteration of the sequence or abundance of the protein products of mutated 
genes by changing the identify of individual residues in the primary protein sequence, by 
fusing the coding sequence of more than one gene to create fusion protein and by 
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changing the abundance of otherwise normal proteins through mutagenesis-induced 
changes in the production, stability or translatability of transcripts firom mutated genes. 

Since most eukaryotic genomes are at least diploid (e.g., each gene consists of at 
least two copies), when a gene is mutated in a eukaryotic cell in this fashion, only one 
5 copy is typically mutated so that the other copy remains unmutated. Therefore, a 
homozygous mutation, in which gene expression from both copies of a given gene is 
eliminated, is not typically achieved by physicochemical mutagenesis alone. 

Therefore, in order to circumvent this deficiency, insertional mutagenesis is also 
carried out in addition to the physicochemical mutagenesis. In this procedure, the level 

10 of physicochemical mutagenesis is adjusted to be mutagenic but not to generate the 
phenotype of interest in the absence of the insertional mutagenesis. One or more 
insertional mutagens is mserted into the genome of the host cell or organism in such a 
mamier so as to alter the expression of a functional gene product (e.g., an RNA or 
protein) of one or more cellular genes. The one or more inserted mutagens also have the 

1 5 property of "tagging" the insertionally mutated gene, thereby allowing it to be identified. 

By carrying out both physicochemical mutagenesis and insertional mutagenesis 
on the same cell, a cell is created in which one or more genes have been mutated by the 
physicochemical mutagen and one or more genes have been mutated by the insertional 
mutagen. In one such embodiment of the invention, a cell is created in which one copy 
20 of a given gene has been mutated by the physicochemical mutagen and the other copy of 
the same gene has been mutated by the insertional mutagen, thereby creatmg cells that 
carry homozygous mutations in that gene. Such mutations can produce desired 
phenotypes. The mutant cell can thus be screened for the production of desired 
phenotypes and the tag can be used to identify the gene responsible for the phenotype. 
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Using the two mutational approaches, cells are also provided that contain 
mutations in more than one gene which cumulatively act to produce a desired 
phenotype. At least one of the mutated genes that contribute to causing the phenotype is 
tagged by an insertional mutagen. Takmg a simple case in which two genes are required 
5 to be mutated, the invention includes the following scenarios (1) Gene #1 is mutated by 
physicochemical mutagenesis and Gene #2 is mutated by insertional mutagenesis in one 
cell. (2) Gene #1 is mutated by insertional mutagenesis and Gene #2 is mutated by 
physicochemical in a second cell. Both cells exhibit the desired phenotype, which is 
caused by the mutation of two genes cumulatively. In the first cell, only Gene #2 can be 
10 identified by the tag. In the second cell, only Gene #1 can be identified by the tag. 
These two cells, however, provide complete information about the identity of the genes 
that must be mutated to achieve the desired phenotype. 

The two types of mutational events described herein can be carried out in either 
order, or simultaneovisly. The mutational events can also be repeated, such that a given 

15 cell, population of cells or organism can be subjected to physicochemical mutagenesis 
and/or insertional mutagenesis one or more (e.g., two, three, four, five, six, seven, eight, 
nine, ten, fifteen, twenty, etc.) tunes, in any order or simultaneously. At any point(s) in 
this process the cells or organism can be screened for a desired phenotype, or for 
insertion of an insertional mutagen both having a desired phenotype or containing the 

20 insertion may be isolated and cloned. In a preferred embodiment, physical chemical 
mutagenisis proceeds insertional mutagenisis, cells are selected for insertional 
mutagenisis. The insertional mutagen is a 5' gene trap with signals for site specific 
recombination and transposition. 
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The invention also provides polynucleotides that act as insertional mutagens for 
use in the insertional mutagenesis methods of the invention. Such polynucleotides can 
have any nucleotide sequence and any geometry (e.g., linear, circular, coiled, 
supercoiled, etc.), and can be double-stranded or single-stranded. Preferred insertional 
5 mutagens are described in the Figvires and Examples herein. The only limitation is that 
they be mutagenic and provide the tagging function. 

In a preferred embodiment, the insertional mutagen comprises a splice acceptor 
sequence that is not operably-linked to a promoter sequence. In other preferred 
embodiments, the insertional mutagen comprises one or more of the following elements: 
10 stop codons in all three frames found 3' to the splice acceptor, an internal ribosome entry 
site, a selectable marker, and a polyA trap. PolyA traps are described m detail in U.S. 
Application No. 09/276,820 herein mcorporated by reference for the teaching of polyA 
traps. In the above embodiment, the selectable marker preferably is operably-linked to a 
polyadenylation signal. In other preferred embodiments, the msertional mutagen 
15 contains retrovirus sequences that allow the retrovirus replication and infection cycle. In 
other preferred embodiments, the insertional mutagen contains sequences necessary for 
transposition. In a highly preferred embodiment, the insertional mutagen contains a 
splice acceptor that is not operably-linked to a promoter, the spUce acceptor havmg an 
optimal branch point, stop codons m all three frames, an internal ribosome entry site that 
20 includes an exonic splicmg enhancer, a selectable marker with a polyadenylation signal, 
a polyA trap, and wherein these elements are contained in a retrovirus vector or contain 
transposition signal sequences. In a fiirther preferred embodiment, the insertional 
mutagen contains a splice acceptor not operably-linked to a promoter, the splice acceptor 
containing an optimal branch point, stop codons in all three frames, an internal ribosome 
25 entry site, a selectable marker with a polyadenylation signal operably-linked to it, and 
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wherein tiie insertional mutagen is a retrovirus vector or contains transposition signals. 
Furthermore, in any of the embodiments herein and especially in the preferred 
embodiments of above, preferred insertional mutagens also contain recombination sites 
for site specific recombination as described herein. 

5 The invention also encompasses cells created by the methods of the invention, 

which preferably are eukaryotic cells (e.g., plant cells, fungal cells (including yeast 
cells), animal cells (including insect cells, avian cells, worm cells, mammalian cells 
(including human cells, non-human primate or simian cells, rodent (rat, mouse, etc.) 
cells, rabbit cells, bovine cells, ovine cells, porcine cells, canine cells, feline cells), and 

10 the like). These cells can be isolated and cloned, using methods that are well-known in 
the art to those of ordinary skill. Primary or estabUshed cells can be used in the methods 
of the invention to prepare the mutated cells. 

In another aspect, the invention also provides libraries of cells created by the 
present invention. A physicochemical/insertional mutation library is created when both 

15 mutational processes are carried out on more than one cell, preferably 10^-10* cells (for 
cells with large genomes such as mammalian cells) or as needed to ensure mutational 
saturation with the combined mutagenesis. The population of cells can all be the same 
(as in a cell line), or can comprise different subpopulations (as, for example, in cell 
populations prepared J&om tissues). Each clone in such libraries may contain a set of 

20 mutated genes that is distmct firom the set of mutated genes in other clones within the 
library. Alternatively, the same genes may be mutated in different clones but the type of 
mutation of each gene could be different. For example, all three genes. Genes #1, #2, 
and #3, could be mutated in three different clones but the type of mutation (insertional 
vs physicochemical) could differ. (See schematic) Such libraries of cells therefore, are 
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useful to rapidly screen for desired phenotypes (e.g., changes from the wildtype or 
nonmutant phenotype) that result from various single mutations or various combinations 
of mutations. 

A library can also encompass a population (two or more, preferably 10^-10^ of 
5 cells that has been subjected to either insertional mutagenesis or physicochemical 
mutagenesis but not both. These libraries serve as a population of cells that form a 
substrate for fiirther mutation by the second mutagenic process of the mvention. The 
efficiency of physicochemical mutagenesis permits the creation of smaller libraries that 
contain mutation in all genes than are obtained using the less efficient insertional 
10 mutagenesis. 

Accordingly, the invention encompasses a method for making a library of 
mutagenized cells by insertionally mutagenizing at least one physicochemically 
mutagenized cell from among two or more physicochemically mutagenized cells. The 
invention is also directed to a method for making a library of mutagenized cells by 
1 5 physicochemically mutagenizing at least one insertionally mutagenized cell from among 
two or more insertionally mutagenized cells. The invention also encompasses methods 
for making a Ubrary of mutagenized cells by simultaneously mutagenizing two or more 
cells with one or more physicochemical mutagens and one or more insertional mutagens. 
The invention also encompasses methods for making a library of mutagenized cells by 
20 subjecting two or more cells containing a physicochemical mutagen to insertional 
mutagenisis. The invention is also directed to a method for making a library of 
mutagenized cells by administering a physicochemical mutagen to two or more cells 
containing an insertional mutagen. One or more cells can be screened for mutation of a 
specific desired gene or other desired phenotype at any stage of the process, such as after 
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any physicochemical mutagenesis event, after any insertional mutagenesis event, or after 
both or all mutagenesis events. Cells having a desired phenotype can be isolated and 
cloned. Cells can also be screened for the possession of an insertional mutagen at any 
stage of tiie process, such as after any physicochemical mutagenesis event, after any 
5 insertional mutagenesis event, or after both or all mutagenesis events. These cells can 
be subjected to further insertional mutagenesis or physicochemical mutagenesis. These 
cells can be isolated and/or cloned at any stage of the process, such as after selection for 
possession of an insertional mutagen. Such clones can accordingly form the substrate 
for further mutagenesis events. In one embodiment, physicochemical mutagenesis is 
10 performed on a plurality of cells. The cells are then expanded to form clonal 
populations and these populations are subjected to insertional mutagenesis so that a 
heterogeneous popiilation of insertionally mutagenized cells is formed. These cells may 
or may not be expanded but can be screened for mutation of a specific desired gene or 
for a desired phenotype. In another exemplary embodiment, a plurality of cells is 
15 physicochemical mutated, the cells expanded, and then insertionally mutagenized. This 
first insertion may or may not be the insertion that causes the phenotype. An insertion 
that causes the phenotype may be produced from fiirther insertions of tlie original 
insertional mutagen. For example, after the original insertional mutagenesis event, the 
cells could be expanded and, after expansion, a component could be introduced into the 
20 cell or induced in the cell that will cause further insertional mutagenesis, such as a 
transposase. In a related exemplary embodiment, cells can be both physicochemically 
mutagenized and insertionally mutagenized (simultaneously) expanded, and a factor to 
cause insertional mutagens can be introduced either into the cells or can be induced in 
the cells. In a related exemplary embodiment, cells can be insertionally mutagenized 
25 with the introduction of a first insertional mutagen, ejcpanded, and then a factor causmg 
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further insertion from the endogenous insertional mutagen can be activated in the cells 
which can then be physicochemically mutated. As mentioned, cells may or may not be 
expanded following each treatment. 

At any stage after a phenotype is produced by mutation, the presence of site- 
5 specific recombination sequences on the insertional mutagen can be used to ascertain 
whether the insertion caused the phenotype. If the excision of the insertional mutagen 
reverts the phenotype, this indicates that mutation causing the phenotype was caused by 
an insertion. 

The invention also provides methods of using the cells and libraries to screen for 
10 phenotypes that are created by the mutagenesis methods of the invention and to identify 
one or more mutations responsible for the phenotype. 

The methods of the invention provide a way to establish the fixnction of a gene. 
Wifli the methods it is possible to determine the function of any specific desired gene. 
Cells can be mutated and screened for a mutation in a specific desired gene with any 
15 assay that can be used to specifically detect a mutation in that gene. The effect of the 
mutation on the cell or animal can then be ascertained. In this case, the tagged 
insertional mutagen is useful for identification of cells containing mutation of the gene 
of interest. 

Altematively, mutated cells can be screened at random for a desired phenotype, 
20 or for production of a desired phenotype in a multi-cellular organism made from the cell, 
and the phenotype can be then correlated with mutation in one or more genes by means 
of the tag. Altematively, a cell that is mutated can be selected on the basis of the 
phenotype it has or confers on an organism made firom the cell, and that phenotype can 



-18- 



wo 03/035834 PCT/US02/33714 
then be correlated with mutation in one or more genes by means of the tag. Thus, any 
change in phenotype of the cell (or of multicellular organism derived from the cell) from 
tiiat of the non-mutated cell (or cellular organism) can be ascribed to the mutated gene. 
Mutated genes that give rise to desired phenotypes can be selected, identified, and 
5 characterized e.g., cloned, sequenced, mapped, etc. According to this aspect of the 
invention, the ftmction of any gene can be identified and assessed. Thus, a phenotype 
can be correlated with a gene that is known in the art (previously identified, e.g., 
mapped, cloned, sequenced, or otherwise characterized) or with a gene that is not known 
in the art. 

10 Although the methods of the invention identify a mutated gene, the invention 

provides a way to correlate the gene with a function and thus provides a way to ascribe a 
function to the wildtype gene and to use that wildtype gene and gene product. The 
invention, therefore, provides for use of the wildtype gene or other natural variant of the 
gene liiat is identified as described above. This includes, but is not lunited to, allelic 

15 variants, homologs, orthologs, pseudogenes, and the like. The wildtype gene, that has 
been identified by means of tibie mutated version, as well as other variants, can be 
isolated, for example, from non-mutated cells, usmg standard recombinant DNA or 
molecular biological techniques, such as cDNA library screening or PGR. The wildtype 
gene/protein or other variant can be used, for example, as a therapeutic protein, antibody 

20 target. Naturally-occurring mutants are also useful as therapeutic or diagnostic targets, 
for example, with antibodies or other detectable and/or inhibiting bindmg reagent. 

In a diseased tissue or cell, a naturally mutant gene gives rise to the disease. 
Further mutation by tiie present method can revert the cell or organism to a normal 
phenotype allowing identification of the mutated disease gene. 
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The invention also encompasses use of the mutated cells to produce transgenic 
animals. Transgenic animals can be created from mutant somatic or germ cells, or from 
mutant stem cells {e.g., embryonic or adult stem cells), that have been produced by 
methods of the invention. Donor cells (which may be a somatic cell, an adult stem cell, 

5 a germ cell or an embryonic stem cell from a donor animal) are subjected to insertional 
mutagenesis and physicochemical mutagenesis in vitro so as to produce a mutated donor 
cell with a single homozygous mutation that produces a desired phenotype in the cell or 
organism or a mutated donor cell with mutations in multiple genes (that cumulatively 
will achieve a desired phenotype in the cell or organism). The animal can be made by 

10 transferring the nucleus from the donor cell to a recipient cell (which may be, for 
example, a fertilized oocyte that has been enucleated), and producing a transgenic 
organism from the recipient cell. Alternatively, the mutant stem cell could be implanted 
into a blastocyst or the mutant germ cell used to create a mutant zygote through in vitro 
fertilization or artificial insemination and the resulting mutant zygotes put into a pseudo- 

1 5 pregnant female to produce the transgenic organism. 

Genetically modified animals can be created by transplantation of nuclei from 
cells that have been mutagenized by the techniques of the present invention. Nuclei 
extracted from mutant cells are then implanted into enucleated fertilized eggs, and the 
resultant zygote is implanted into a pseudopregnant female to develop into an animal 
20 carrying the mutations that were generated in the original mutagenized cell. 

Zygotes can also be formed from mutant embryonic or other pluripotent stem 
cell following the blastocyst fusion protocols that have been developed for the creation 
of genetically modified mice. 
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Briefly, the modified stem cells are combined with cells of a diploid or tetraploid 
morula or the modified cells are injected directly into the blastocoel of a developing 
blastocyst. The chimeric zygote that resuhs is implanted into a pseudopregnant female 
to develop into an animal carrying the mutations generated in the stem cell. Genetically 
5 modified germ cells can be created by in vitro retroviral-mediated or other gene delivery 
into spermatogonia! stem cells of both adult and immature animals and can result in 
stable integration of the insertional mutagen in 2-20% of stem cells. After 
transplantation of the transduced stem cells into the testes of infertile recipient animals, 
approximately 4.5% of progeny from these males contain the msertional mutagen, and 
10 this mutagenic vector is transmitted to and functions in subsequent generations. 1: 
Chesne P, Adenot PG, Viglietta C, Baratte M, Boulanger L, Renard JP.Nat Biotechnol. 
2002 Apr;20(4):366-9; 2: Hosaka K, Ohi S, Ando A, Kobayashi M, Sato K. Hum Cell. 
' 2000 Dec;13(4):237-42; 3: Wolf E, Zakhartchenko V, Brem G. Biotechnol. 1998 Oct 
27;65(2-3):99-l 10.; In Hogan B, Beddington R, Costantini F, Lacy E. Manipulating the 
15 Mouse Embryo; a Laboratory Manual Cold Spring Harbor Laboratory Press. 1994; 1: 
Cecconi F, Gruss P.Methods Mol Biol. 2002;185:335-46. Review; 1: Brinster RL. 
Science. 2002 Jun 21;296(5576):2174-6; 2: Nagano M, Brinster CJ, Orwig KE, Ryu 
BY, Avarbock MR, Brinster RL. Proc Natl Acad Sci USA. 2001 Nov 6;98(23): 13090- 
5; 1: Cecconi F, Gruss P. Methods Mol Biol. 2002;185:335-46. 

20 In another embodiment, transgenic organisms can be created from cells chosen 

because they display a desired phenotype in vitro after being subjected to insertional and 
physicochemical mutagenesis according to the methods of the invention. The identity of 
the gene or genes responsible for the phenotype may or may not be known at this stage 
for the preparation of transgenic animals. In another embodiment transgenic organisms 

25 can be created from a cell with a mutation in a specific desired gene. In another 
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embodiment, mutated cells can be selected at random, used to make the transgenic 
organism, and llie transgenic organism can be screened for a desired phenotype. 

The mvention also encompasses methods for making transgenic animals and 
transgenic animals produced by the present methods. Transgenic animals that can be 

5 produced by the methods of the invention include, for example, insects (including 
Drosophila, Spodoptera and Trichoplusa species), birds, worms (including C. elegans\ 
fish (including zebrafish), mammals (including humans, and non-human mammals such 
as simians and other non-human primates, mice, rats, pigs, cows, sheep, dogs, cats, and 
the like). The transgenic animals can be used, for example, as models for human 

10 disease, to study gene function, to screen for phenotypes of interest, for agricultural 
applications, or for drug testing. 

In one aspect of the invention, transgenic animals are produced from cells that 
have been physicochemically mutated in vitro. The cells of the recipient animal may 
then be mutated by insertional mutagenesis. The insertional mutagen can be introduced 

15 exogenously into the animal or can be induced or activated from an endogenous 
incorporated insertional mutagen. In one embodiment, the insertional mutagenesis is 
achieved by crossing the transgenic animal produced from a physicochemically mutated 
cell with another animal containing an insertional mutagen in its germ line, such as a 
transposon, such as those described herein. The animal can be screened for mutation of 

20 a specific desired gene or for expression of another desired phenotype after the 
physicochemically mutated cell has been used to form the animal and/or after the 
insertional mutagenesis. 
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The invention also provides metiiods of producing transgenic plants, and 
transgenic plants produced by such methods. Transgenic plants that can advantageously 
be produced according to such methods include dicotyledenous and monocotyledenous 
plants. 

5 The invention also encompasses the use of the mutant cells for drug screening. 

In this embodiment, mutant cells are exposed to test compounds or compositions which 
may have therapeutic potential, to determine the effect of the compound or composition 
on a desired phenotype induced by one or more mutations m the mutant cells, including 
the level of expression or activity of the mutated gene or protein. Furthermore, the 

10 wildtype genes or other variants that correspond to the mutated gene or genes, also can 
be used to identify drugs that affect a phenotype caused by the gene or genes, including 
the level of expression or activity of the wildtype or variant gene or protein of interest. 

Other preferred embodiments of the present invention will be apparent to one of 
ordinary skill in Ught of what is known in the art, in light of the folloA^ang drawings and 
1 5 description of the invention, and in light of the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS/FIGURES 
Figures lA-lJ: Non-limitmg examples of 5' gene trap insertional mutagens 
vectors useful in the present invention. Each insertional mutagen is illustrated 
schematically in its linear form; however, insertional mutagens of the invention can have 
20 any geometry (linear, circular, coiled, supercoiled, etc.). Horizontal lines and boxes 
indicate polynucleotides, such as DNA or RNA. Stop codons can be present in any 
reading frame, or nested such that they are present in all reading frames. S/A represents 
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a splice acceptor site. S/D represents a splice donor site. pA represents a 
polyadenylation signal. ET represents an epitope tag. 

Figures 2A-2H: Non-limiting examples of 5' gene trap insertional mutagens useful 
in the present invention. Each insertional mutagen is illustrated schematically in its 
5 linear form. Horizontal lines and boxes indicate polynucleotides such as DNA or RNA. 
SM and RG represent selectable marker and reporter gene, respectively. S/A represents 
a splice acceptor site. S/D represents a splice donor site. pA represents a 
polyadenylation signal. IRES represents internal ribosomal entry site. 

Figures 3A-3B: Non-limiting examples of 5' gene trap insertional mutagens useful 
10 in the present invention. Each insertional mutagen is illustrated schematically in its 
linear form. Horizontal lines and boxes indicate polynucleotides such as DNA or RNA. 
SM and RG represent selectable marker and reporter gene, respectively. S/A represents 
a splice acceptor site. pA represents a polyadenylation signal. IRES represents internal 
ribosomal entry site. 

1 5 Figures 4A-4C: Non-limiting examples of 5' gene trap insertional mutagens useful 
in the present invention. Each insertional mutagen is illustrated schematically m its 
linear form. Horizontal lines and boxes indicate polynucleotides such as DNA or RNA. 
p-geo is a fusion of the neomycin resistance gene and P-galactosidase gene. S/A 
represents a spUce acceptor site. pA represents a polyadenylation signal. IRES 

20 represents internal ribosomal entry, site. 5' LTR and 3' LTR represent retroviral long 
terminal repeats. \\f represents retroviral packaging signal. In Figure 4B, the 5' LTR, 3' 
LTR, and y are shown upside down to indicate that the retroviral sequence is in reverse 
orientation relative to the splice acceptor site and p-geo gene. In Figure 4C, the solid 
boxes represent transposon signals. 
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Figures 5A-5E: Non-limiting examples of 5' gene trap insertional mutagens 
containing a 3' gene trap component. Each insertional mutagen is illustrated 
schematically in its linear form. Horizontal lines and boxes indicate polynucleotides 
such as DNA or RNA. Arrows represent promoters. SM and RG represent selectable 

5 marker and reporter gene, respectively, p-geo is a fusion of the neomycin resistance 
gene and |3-galactosidase gene. S/A represents a splice acceptor site. S/D represents a 
splice donor site. pA represents a polyadenylation signal. IRES represents internal 
ribosomal entry site. 5' LTR and 3' LTR represent retroviral long terminal repeats, y 
represents retroviral packaging signal. In Figure 5D, the 5' LTR, 3' LTR, and Y 

10 shown upside down to indicate that the retroviral sequence is in reverse orientation 
relative to the sphce acceptor site and p-geo gene. In Figure 5E, the solid boxes 
represent transposon signals. 

Figures 6A-6C: Proposed mechanism of gene mutation using a 5' gene trap. 
Vector is illustrated schematically in its linear form. Horizontal lines and boxes mdicate 

15 polynucleotides such as DNA or RNA. Arrows represent promoters, p-geo is a fusion 
of the neomycin resistance gene and p-galactosidase gene. S/A represents a spUce 
acceptor site. pA represents a polyadenylation signal. IRES represents internal 
ribosomal entry site. Figure 6A shows the insertional mutagen and the endogenoiis gene 
prior to vector insertion. Figure 6B shows the endogenous gene following insertion of 

20 the insertional mutagen. Figure 6C shows the fusion mRNA produced from the 
endogenous gene. 

Figures 7A-7B: Non-limiting examples of 5' gene trap insertional mutagens 
containing a promoter linked to a selectable marker or reporter gene followed by a 
polyadenylation signal. Each insertional mutagen is illustrated schematically in its 
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linear form. Horizontal lines and boxes indicate polynucleotides, such as DNA or RNA. 
Arrows represent promoters. SM and RG represent selectable marker and reporter gene, 
respectively, p-geo is a fusion of the neomycin resistance gene and |3-galactosidase 
gene. S/A represents a splice acceptor site. pA represents a polyadenylation signal. 

5 IRES represents internal ribosomal entry site. 5' LTR and 3' LTR represent retroviral 
long terminal repeats. \|/ represents retroviral packaging signal. In Figure 7B, the 5' 
LTR, 3' LTR, and \|/ are shown upside down to indicate that the retroviral sequence is in 
reverse orientation relative to the splice acceptor site and p-geo gene. The presence of 
the promoter operably linked to a selectable marker/reporter gene and polyadenylation 

1 0 signal allows selection of integrated insertional mutagens independent of whether or not 
integration has occurred in a transcriptionally active region of the genome. The figure is 
illustrative of several insertional mutagen types containing selectable markers; however, 
the promoter/selectable marker/polyadenylation signal unit can be used on any of the 
insertional mutagens described herein. 

15 Figures 8A-8C: Non-limiting examples of 5' gene trap msertion msertional 
mutagens containing site-specific recombination signals. Each insertional mutagen is 
illustrated schematically in its linear form. Horizontal lines and boxes indicate 
polynucleotides such as DNA or RNA. Arrows represent promoters. Filled triangles 
represent site-specific recombination signals. The site-specific recombination signals 

20 can be in any orientation relative to one another. The figure depicts an orientation that 
promotes excision of the insertional mutagen from the genome. If the signals are placed 
on the insertional mutagen in the opposite direction relative to each other, the insertional 
mutagen would be inverted in the genome following site specific recombination. N-SM 
represents a negative selectable marker gene, p-geo is a fusion of the neomycin 
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resistance gene and p-galactosidase gene. S/A represents a splice acceptor site. pA 
represents a polyadenylation signal. IRES represents internal ribosomal entry site. 5' 
LTR and 3* LTR represent retroviral long terminal repeats, y represents retroviral 
packaging signal. In each example, the 5' LTR, 3' LTR, and \|/ are shown upside down 

5 to indicate that the retroviral sequence is a reverse orientation relative to the splice 
acceptor site and p-geo gene. The position of the site-specific recombination signals hi 
figures 8A and 8C is shown m the viral LTRs such that most of the viral insertional 
mutagen can be excised (see, e.g., Ishida, NucL Acids Res. 27: e35 (1999)). The 
presence of the promoter operably linked to a negative selectable marker and 

10 polyadenylation signal allows selection for cells in which the mtegrated insertional 
mutagen has been excised. The present figure is illustrative of several insertional 
mutagen types containing site-specific recombination signals; however, the site-specific 
recombmation signals can be used on any of the insertional mutagens described herein, 
iacluding the non-viral insertional mutagens. 

15 Figures 9A-9H: Non-limiting examples of 5' gene trap insertional mutagens 
containing site-specific recombination signals (also equivalently referred to herem as 
recombination sites). Each insertional mutagen is illustrated schematically in its linear 
form (although insertional mutagens can exist in any conformation, mcluding linear, 
chcular, coiled, supercoiled, branched, etc.). Horizontal lines and boxes indicate 

20 polynucleotides such as DNA or RNA. Arrows represent promoters. Filled triangles 
represent site-specific recombination signals. The site-specific recombmation signals 
can be in any orientation relative to one another. Recombination sites shown in opposite 
orientation relative to one another (e.g.. Figs. 9D-9F and 9H) produce an inversion 
following recombination, whereas recombination sites shown in the same orientation 

25 relative to one other (e.g. , Figs. 9A-9C and 9G) produce a deletion upon recombination. 
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N-SM represents a negative selectable marker gene, while P-SM represents a positive 
selectable marker gene. Neo represents a neomycin resistance gene. TK represents a 
herpesvirus thymidine kinase (HS V-TK) gene. S/A represents a spUce acceptor site. pA 
represents a polyadenylation signal. IRES represents an internal ribosome entry site. 
5 The presence of the promoter operably linked to a negative selectable marker and 
polyadenylation signal allows selection for cells in which the integrated insertional 
mutagen has been excised. Where the positive selectable marker and/or the negative 
selectable marker lack a promoter on the insertional mutagen, the marker can be 
expressed from an endogenous promoter upon integration of the insertional mutagen into 
10 the genome of the host cell. The present figure is illustrative of several insertional 
mutagen types containing site-specific recombination signals; however, the 
recombination sites can be used on any of the insertional mutagens described herein, 
including the non-viral insertional mutagens. In addition, any of the insertional 
mutagens shown in this figure can optionally contain no (or only one) site-specific 
15 recombination signal(s). The insertional mutagens depicted in Figures 9A-9F can 
optionally lack the S/A, IRES and/or pA signal. Each of the insertional mutagens shown 
optionally can be configured as a viral insertional mutagen and therefore can contain 5' 
and 3' LTRs and packagmg signals. As one of ordinary skill will readily appreciate, 
otiier insertional mutagen elements described herein and/or recognized in the art can be 
20 included in the insertional mutagens in addition to the elements illustrated in the figures. 

Figures lOA-lOG: Non-limiting examples of 5' gene trap insertional mutagens 
containing multiple exons. Each insertional mutagen is illustrated schematically in its 
linear form (although vectors can exist in any conformation, including luiear, circular, 
coiled, supercoiled, branched, etc.). Horizontal lines and boxes mdicate polynucleotides 
25 such as DNA or RNA. Arrows represent promoters. S/A represents a splice acceptor 
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site, and S/D represents a splice donor site. pA represents a polyadenylation signal. 
IRES represents an internal ribosome entry site. SM indicates a positive or negative 
selectable marker. A reporter gene can be substituted for the SM on any of the 
insertional mutagens shown in this figure. In Fig. lOF and lOG, the selectable marker 

5 open reading frame has been separated onto different exons. Upon transcription from an 
endogenous gene, followed by splicing, the open reading frame will be reconstituted to 
produce a fimctional SM. It will be recognized by the ordinarily skilled artisan that each 
of the insertional mutagens depicted in this figure can optionally contain one or more 
site-specific recombination signals (see Figure 9). Optionally, the insertional mutagens 
10 depicted in this figure can lack the S/A, IRES, S/D and/or pA signal. Each of the 
insertional mutagens shown optionally can be configured as a viral insertional mutagen 
and therefore can contain 5' and 3' LTRs and packaging signals. As one of ordinary skill 
will readily appreciate, other elements described herein and/or recognized in the art can 
be included in the insertional mutagens in addition to the elements illustrated in the 

15 figures. 

Figure 11: Method for detecting gene trap insertions that occur in developmentally 
regulated genes. In this example, cells are identified in which a transcriptionally active 
gene became down regulated or silenced in response to specific treatments or 
environmental stimuli to the cells. DNA is illustrated schematically in its linear form. 
20 Horizontal lines and boxes indicate polynucleotides such as DNA or RNA. Arrows 
represent promoters. Filled triangles represent site-specific recombination signals. The 
site-specific recombination signals are depicted in an orientation that promotes excision 
of the positive selectable marker from the genome. P-SM and N-SM represent positive 
selectable marker and negative selectable marker, respectively. S/A represents a splice 
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acceptor site. S/D represents a splice donor site. pA represents a polyadenylation 
signal. IRES represents internal ribosomal entry site. 

Figure 12: Method for detecting gene trap insertions that occur 'm developmentally 
regulated genes. In this example, cells are identified in which a transcriptionally silent 

5 gene (or minimally expressed gene) became tumed on or enhanced in response to 
specific treatments or environmental stimuli to the cells. DNA is illustrated 
schematically in its Unear form. Horizontal lines and boxes indicate polynucleotides 
such as DNA or RNA. Arrows represent promoters. Filled triangles represent site- 
specific recombination signals. The site-specific recombination signals are depicted in 

10 an orientation that promotes excision of the negative selectable marker firom the genome. 
P-SM and N-SM represent positive selectable marker and negative selectable marker, 
respectively. S/A represents a splice acceptor site. S/D represents a splice donor site. 
pA represents a polyadenylation signal. IRES represents internal ribosomal entry site. 

Figure 13: Method for detecting gene trap insertions that occur in developmentally 
15 regulated genes. In this example, cells are identified in which a transcriptionally silent 
gene (or minimally expressed gene) became tumed on or enhanced in response to 
specific treatments or environmental stimuli to the cells. In other examples, it is 
possible to identify cells in which a transcriptionally active gene became down regulated 
or silenced in response to specific treatments or environmental stimuU to the cells. This 
20 is accomplished using the vector shown in this figure in combination with the selection 
scheme shown in figure 11 (i.e. selection for the positive selectable marker, then 
treatment of cells with an agent capable of altering its expression pattem, and selecting 
agamst cells expressing tiie negative selectable marker). DNA is illustirated 
schematically in its linear form. Horizontal lines and boxes indicate polynucleotides 
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such as DNA or RNA. Arrows represent promoters. Filled triangles represent site- 
specific recombination signals. The site-specific recombination signals are depicted in 
an orientation that promotes inversion of the positive and negative selectable markers 
within the genome. P-SM and N-SM represent positive selectable marker and negative 
5 selectable marker, respectively. S/A represents a splice acceptor site. S/D represents a 
splice donor site. pA represents a polyadenylation signal. IRES represents internal 
ribosomal entry site. 

Figure 14: The Figure schematically shows non-limiting examples of cells that result 
from mutagenesis according to the present invention and examples of how genes could 
10 be tagged for detection. "X" denotes a physicochemical event. "T" denotes a tag 
introduced by insertional mutagenesis event. 

(A) Phenotype results from homozygous mutation of single gene. Both copies of 
gene A contain a mutation. The gene can be identified by the tag on one copy. Only 
one cell is required to identify the gene responsible for the phenotype (1). Cell II can be 

15 discarded. 

(B) Phenotype res\alts from heterozygous mutation of two different genes. Cell I 
allows identification, in the same cell, of the two genes responsible for the phenotype. 
Cells n and III are used m combination to identify both genes responsible for the 
phenotype or separately to identify one of the genes responsible for phenotype. Cell IV 

20 can be discarded. 

(C) Phenotype results from heterozygous mutation of Gene A and homozygous 
mutation of Gene B. Cell I allows identification, in the same cell, of the two genes 
responsible for the phenotype. Cells II and III are used in combination to identify both 
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genes responsible for the phenotype or separately to identify one of the genes 
responsible for phenotype. Cell IV can be discarded. 

(D) Phenotype results from homozygous mutation of two different genes. Cell I 
allows identification, in the same cell, of the two genes responsible for the phenotype. 
5 Cells II and HI are used in combination to identify both genes responsible for the 
phenotype or separately to identify one of the genes responsible for phenotype. Cell FV 
can be discarded. 

Figure 15: A schematic representation of a specific example of the combined 
mutagenesis procedure for efficient mutation and tagging of genes involved in creating a 
1 0 desired phenotype. In Step 1 , ENU creates many mutations per cell. In Step 2, gene 
trap mutagen knocks out and tags additional alleles in ENU clones to create phenotype 
and permit identification of the mutated genes. 

Figure 16: pDK02 gene trap knockout vector design and function. Integration of 
1 5 pDK02 retrovirus traps upstream exons of endogenous gene. The following 

abbreviations are used in the figure; S/A: branch site and splice acceptor from the intron 
of an immunoglobulin gene heavy chain variable region obtained by PCR from pCI 
vector (Promega); x: stop codons in all 3 reading frames; IRES: wild type internal 
ribosomal entry site from EMCV; obtained by PCR from pESLVPO (ATCC#67525); 
20 D^: neomycin resistance gene, (pDKO vectors containing otiier selection markers, such 
as those conferring resistance to puromycin, hygromycin, or zeocin, have been created in 
pDK02 derivatives.); bGHpA: bovine growth hormone polyA sequence; PCR from 
pcDNAS.l (Invitrogen); lox: lox71/lox66 sequences, ere recombinase recognition sites; 
TK: thymidine kinase ORF; PGK: promoter; and retrovmis packaging signal. 
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Figure 17: A schematic representation of a'combined mutagenesis screen to identify 
genes needed for FasL induced apoptosis. 

5 Figure 18: A schematic representation of the Fas induced apoptosis pathway. To 
identify gene traps that affect sensitivity to FasL, the doubly mutagenized combined 
mutagenesis library was selected for clones that exhibited resistance to Fas-induced 
apoptosis. Under the FasL selection conditions, there is a low spontaneous background 

1 0 Figure 19: Cre mediated excision of combined mutagenesis gene trap reversion of 
Fas-resistant phenotype. Clones showing FasL resistance caused by the pDK02 gene 
trap were analyzed using RT-PCR, 5 '-RACE and inverse PCR to identify the 
biologically active trapped genes. 



15 
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DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

In the description that follows, a number of terms used in recombinant DNA 
technology are utilized extensively. In order to provide a clear and consistent 
5 understanding of the specification and claims, including the scope to be given such 
terms, the following definitions are provided. 

As used herein, the term "physicochemical mutagenesis" means any method of 
mutating genes that is not insertional mutagenesis (i.e., by insertional mutagen as 
defined below), such as ionizing radiation and/or chemical approaches to induce one or 

10 more mutations in a cell or organism. Physicochemical mutagenesis, therefore, 
encompasses use of chemical mutagens, radiation {e.g., UV, a radiation, p radiation, y 
radiation, x-rays), error prone replication proteins (for example, without limitation, 
mutant DNA polymerases, such as those that lack a proofreading function), restriction 
enzymes (used to create DNA breaks and deletions upon introduction into a host cell), 

15 and DNA repair mutants and inhibitors (used to enhance mutation from spontaneous and 
mduced mutation). Examples include, but are not limited to, mutant cell lines defective 
for the genetic complementation groups of xeroderma pigmentosiim (XP) known as XP- 
G and XP-A, the DNA repair protein Ku, and the DNA-dependent protein kinase DNA- 
PK. An example of a DNA repair inhibitor is O^-BG, an inhibitor of O^-alkylguanine- 

20 alkyltratisferase. Any physicochemical mutagen can be used alone or in combination 
with one or more other physicochemical mutagens. 
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"Insertional mutagenesis", as it relates to the invention, means a process in which 
a polynucleotide is inserted into the genome of a cell in such a way so as to mutate an 
endogenous gene. As used herein the terms "incorporation" or "integration" or 
"insertion" into an endogenous gene are used synonymously. 

5 Insertional mutagenesis can occiir when an insertional mutagen is introduced into 

a cell exogenously and as a result of the exogenous introduction becomes incorporated 
into the genome so as to mutate one or more endogenous genes. The invention, 
however, is also directed to mutagenesis events that occur when an endogenous 
insertional mutagen is caused to insert into locations that are different from the original 

10 location. Such is the case when an endogenous transposable element which is induced to 
further transposition by the action of a transposase. Accordingly, in one embodiment of 
the invention, insertional mutation of an allele or a gene results from transposition of an 
endogenous insertional mutagen. This endogenous insertional mutagen may be 
naturally-occurring in the cell or may have been infroduced into the cellular genome or 

15 the genome of a precursor cell such as a precursor cell in vitro or precursor cell in vivo. 

In one aspect of simixltaneous introduction of different mutagens to a cell, one or 
more of the mutagens is produced endogenovisly. One or more mutagens is present in 
the genome of the cell and can provide for fiirther insertion into the genome at one or 
more new locations. Thus, simultaneous mutagenesis can occur by causing the new 
20 insertions of one or more different mutagens from within the cell and can also occur 
when this endogenous infroduction is concurrent in time with the introduction of an 
exogenous mutagen. 
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The term "gene disruption" as used herein refers to a gene knock-out or knock- 
down in which an insertional mutagen is integrated into an endogenous gene thereby 
resulting expression of a fusion transcript between endogenous exons and sequences in 
the insertional mutagen. 

5 The mutation can result in a change in the expression level of a gene or level of 

activity of a gene product. "Activity" encompasses all functions of a gene product, e.g. 
structural, enzymatic, catalytic, allosteric, and signaling. In one embodiment, mutation 
results in a decrease or elimination of gene expression levels (RNA and/or protein) or a 
decrease or elimination of gene product activity (RNA and/or protein). Most mutations 
10 will decrease the activity of mutated genes. However, both the insertional and 
physicochemical mutagens can also act to increase or to qualitatively change (e.g. 
altered substrate on binding specificity, or regulation of protein activity) the activity of 
the product of the mutated gene. Although mutations will often generate phenotypes 
that maybe difficult to detect, most phenotypically detectable mutations change the level 

15 or activity of mutated genes in ways that are deleterious to the cell or organism. 

The insertional mutagens can also be used to "tag" the mutated gene at least at 
the DNA level, and in one embodiment, at the RNA or protein level, depending on the 
insertional mutagen. The mutagenic sequence may by itself be detectable so as to "tag" 
the insertionally mutated gene or product of the gene. 

20 Thus, as used herein, the term "tag" refers to a structural or functional feature 

(typically, a nucleotide sequence) contained on an insertional mutagen of the invention, 
which permits the location of the insertional mutagen to be determined once it has been 
inserted into a target nucleic acid molecule via recombination (e.g., into the genome of a 
target cell). The tag, accordingly, not only enables location of the insertional mutagen. 

-36- 



wo 03/035834 PCT/US02/33714 

It enables the locus into which the vector has inserted to be identified. Examples of tags 
include, but are not limited to, nucleotide sequences encoding a reporter gene (e.g., p- 
lactamase, p-galactosidase, luciferase, chloramphenicol acetyl transferase, green 
fluorescent protein and its derivatives, yellow fluorescent protein and its derivatives, 

5 blue fluorescent protein and its derivatives, cyan fluorescent protein and its derivatives, 
red fluorescent protein and its derivatives, and the like), and nucleotide sequences 
encoding a selectable marker (which may be a positive selectable marker or a negative 
selectable marker). It should be recognized however that a tag, for purposes of the 
invention need not encode a protein. It simply provides a sequence that allows detection 

10 of either the tag itself or of a nucleotide sequence adjacent to or otherwise linked to the 
tagged sequence. 

The "tag" can also provide the property that the insertional mutagen can be 
detected such that a cell containing the insertional mutagen can be detected (and 
isolated, if desired, or otherwise specifically manipulated). 

15 "5' gene traps" are insertional mutagens that are designed to prevent or reduce 

functional expression of a given endogenous gene upon insertion of the mutagen into the 
gene. 5' gene traps alter gene expression by interrupting the normal splicmg or exon 
structure of the primary transcripts of mutated genes. Splicing interruption is 
accomplished by spUcing of upstream exons of the mutated gene onto splice acceptor 

20 sequences within the insertional mutagen. This change in sphcing often disrupts the 
protein coding of transcripts of the mutated gene. Alternatively, the insertional mutagen 
can insert directly into and disrupt the codmg potential of exons of the mutated gene. 
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"3' gene traps" are insertional mutagens that are designed to activate or otherwise 
enhance transcription of, and optionally translation of, endogenous exons of the gene in 
which the mutagenic vector is inserted. 3' gene traps function by initiating transcription 
of mutated genes at promoter sequences located within the insertional mutagen. 
5 Transcripts initiated in the mutagenic vector continue 3' of the insertion site to mclude 
downstream exons of the mutated gene in chimeric primary transcripts. Exons 
containing vector sequence can splice onto exons of the mutated gene to generate 
chimeric transcripts that encode proteins including all or C-terminal fragments of the 
normal protein product of the mutated gene. 

10 As used herein, "decrease" means that a given gene has been mutated such that 

the level of gene expression or level of activity of a gene product in a cell or organism is 
reduced from that observed in the wildtype or non-mutated cell or organism. This is 
often accompHshed by reducing the amount of mRNA produced from transcription of a 
gene, or by mutating the mRNA or protein produced from the gene such that the 

1 5 expression product is less abimdant or less active. 

As used herein, and unless otherwise indicated, "a" means one or more. 

As used herein, the term "mutated clone" refers to one or more progeny cells 
arising from a mutated parent cell created with one or both mutagenesis methods of the 
invention. 

20 As used herein, "homozygous mutanf refers to a cell in which all copies 

(typically two in most eukaryotic cells, although certain filamentous yeasts, and some 
higher eukaryotic cell lines, may have more than two copies) of a given gene are 
mutated. 
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As used herein, "copies" of genes are also known in the art as "alleles". This 
latter term signifies the naturally-occurring copy of a given gene m a cell. Usually there 
are two copies of a gene in a diploid cell. In some situations (e.g., trisomy 21) there are 
three copies, but fovir or more copies of one or more entire chromosomes, or extra copies 
5 of genes or chromosomal fragments are also encountered naturally. Cells may also be 
experimentally altered to change the number or expression of specific genes. 

The methods and the compositions of the invention may mvolve insertional 
mutagens that contain target nucleotide sequences for homologous recombination. As 
used herein a "target sequence" allows homologous recombination of an insertional 

10 mutagenic nucleotide with cellular DNA at a predetermined site on the cellular DNA, 
the site having homology for sequences in the insertional mutagen, the homologous 
recombination at the predetermined site resulting in the introduction of the msertional 
mutagen into the genome and subsequent mutation. A target sequence may have 
homology where the sequence or sequences withm the gene to be mutated or upstream 

15 or downstream of the gene to be mutated. The use of targeting sequences has been 
disclosed in many U.S. patent appUcations, mcluding U.S. 5,641,670, 6,270,989, and 
5,733,761, all incorporated by reference for teaching a target sequence. 

As used herein, "non-homologous recombination" (which may also be referred to 
equivalently as "illegitimate recombmation") means the joining (exchange or 
20 redistribution) of genetic material through a mechanism that does not involve 
homologous recombination (e.g., recombination directed by sequence homology) and 
that does not involve site-specific recombination (e.g., recombination directed by site- 
specific recombmation signals and a corresponding site-specific recombmase). 
Examples of non-homologous recombmation include integration of exogenous DNA 
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into chromosomes at non-homologous sites, chromosomal translocations and deletions, 
DNA end joining, double strand break repair, bridge-break-fiision, concatemerization of 
transfected polynucleotides, retroviral insertion, and transposition. In most cases, 
non-homologous recombination is thought to occur through the joining of "free DNA 
5 ends." Free ends are DNA molecules liiat contain an end capable of being joined to a 
second DNA end either directly, or following repair or processing. The DNA end may 
consist of a 5' overhang, 3' overhang, or blunt end. 

Retroviral vectors integrate into eukaryotic genomes by a distinct mechanism of 
non-homologous recombination that is catalyzed by the action of the virally encoded 
10 integrase enzyme, and the mechanism of viral integration, replication and infection has 
been well described (reference 0). The mutagenic ability of retroviruses and retroviral 
vectors and their ability to enable the rapid identification of mutated genes through the 
linkage of retroviral tag sequences within tlie transcripts of mutagenized genes are well 
known in the art (reference 2-5). 

15 General reference for mechanisms of retroviral infection, replication, and 

integration: 0: In: Retroviruses. Coffin, JM.; Hughes, SH.; Varmus, HE. Plainview 
(NY): Cold Spring Harbor Laboratory Press; cl 997; Use of wildtype retroviruses as 
mutagens: 1: Varmus HE, Quintrell N, Ortiz S. Cell. 1981 Jul;25(l):23-36; Use of 
retrovirus promoter traps as mutagens and to isolate trapped genes: 2: Friedrich G, 

20 Soriano P. Methods Enzymol. 1993;225:681-701; 3: Gossler A, Joyner AL, Rossant J, 
Skames WC. Science. 1989 Apr 28;244(4903):463-5; 4: Friedrich G, Soriano P. Genes 
Dev. 1991 Sep;5(9):1513-23; 5: von Melchner H, DeGregori JV, Raybum H, Reddy S, 
Friedel C, Ruley HE. Genes Dev. 1992 Jun;6(6):9 19-27; Randomness of retroviral 
insertion: 6: King W, Patel MD, Lobel LI, Goff SP, Nguyen-Huu MC. Science. 1985 
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May 3;228(4699):554-8; 7: Hubbard SC, Walls L, Ruley HE, Muchmore EA. J Biol 
Chem. 1994 Feb 4;269(5):37 17-24. 

Like retroviruses, transposons and transposon vectors can also be used to 
integrate sequences that an act as insertional mutagens. Also like retroviruses, 
5 transposons integrate by enzymatically catalyzed non-homologom recombination in 
which transposase enzymes catalyze the genomic integration and transposition of 
transposon DNA (reference 1, 2, 12, 13). Numerous transposons have been 
characterized that function in insects (reference 13-15), plants (reference 16-20) and 
vertebrates (including mammals, reference 3-12). In particular, the TCI/mariner 
10 derivative transposon. Sleeping Beauty, has been demonstrated to integrate efficiently in 
mammals. Transposons have been shown to function as efficient insertional mutagens 
in numerovis systems (reference 5, 15, 17, 24-26), and to exhibit broad target specificity 
(reference 21-23). Transposase catalyzes SB transposition and integration: 1: Cui Z, 
Geurts AM, Liu G, Kaufman CD, Hackett PB. J Mol Biol. 2002 May 17;318(5):1221- 
15 35; 2: Izsvak Z, Kliare D, Behlke J, Heinemann U, Plasterk RH, Ivies Z. J Biol Chem. 
2002 Jim 24 SB transposon can transpose and act as an insertional mutagen in mammals: 
3: Dupuy AJ, Clark K, Carlson CM, Fritz S, Davidson AE, Markley KM, Finley K, 
Fletcher CF, Ekker SC, Hackett PB, Horn S, Largaespada DA. Proc Natl Acad Sci U S 
A. 2002 Apr 2;99(7):4495-9; 4: Horie K, Kuroiwa A, Ikawa M, Okabe M, Kondoh G, 
20 Matsuda Y, Takeda J. Proc Natl Acad Sci USA. 2001 Jul 31;98(16):9191-6; 5: Dupuy 
AJ, Fritz S, Largaespada DA. Genesis. 2001 Jun;30(2):82-8; 6: Fischer SE, Wienholds 
E, Plasterk RH. Proc Natl Acad Sci USA. 2001 Jun 5;98(12):6759-64; 7: Ivies Z, 
Hackett PB, Plasterk RH, Izsvak Z. Cell. 1997 Nov 14;91(4):501-10. Other transposons 
also function in mammals: 8: Zagoraiou L, Drabek D, Alexaki S, Guy JA, Klinakis AG, 
25 Langeveld A, Skavdis G, Mamalaki C, Grosveld F, Savakis C. Proc Natl Acad Sci U S 
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A. 2001 Sep 25;98(20): 11474-8; 9: Sherman A, Dawson A, Mather C, Gilhooley H, Li 
Y, Mitchell R, Finnegan D, Sang H. Nat Biotechnol. 1998 Nov;16(ll):1050-3; 10: 
Kawakami K, Shima A, Kawakami N. Proc Natl Acad Sci USA. 2000 Oct 
10;97(21):1 1403-8; 11: Fadool JM, Hartl DL, Dowling JE. Proc Natl Acad Sci USA. 
5 1998 Apr 28;95(9):5 182-6; 12: Plasterk RH. Cell. 1993 Sep 10;74(5):781-6. P elements 
developed as insertional mutagen in invertebrates: 13: Kaufinan PD, Rio DC. Nucleic 
Acids Res. 1991 Nov 25;19(22):6336; 14: Rubin GM, Spradling AC. Nucleic Acids 
Res. 1983 Sep 24;11(18):6341-51; 15: Spradling AC, Rubin GM. Science. 1982 Oct 
22;218(4570):341-7. Ac and Ds and other plant transposons transpose, integrate and are 
10 used as insertional mutagens in plants: 16: Grevelding C, Becker D, Kunze R, von 
Menges A, Fantes V, Schell J, Masterson R. Proc Natl Acad Sci USA. 1992 Jul 
l;89(13):6085-9; 17: Walbot V. Curr Opin Plant Biol. 2000 Apr;3(2): 103-7; 18: Pereira 
A, Aarts MG. Methods Mol Biol. 1998;82:329-38; 19: Cooley MB, Goldsbrough AP, 
Still DW, Yoder JI. Mol Gen Genet. 1996 Aug 27;252(1 -2): 184-94; 20: Bhatt AM, 
15 Page T, Lawson BJ, Lister C, Dean C. Plant J. 1996 Jun;9(6):935-45. P element 
transposon can integrate broadly throughout genomes: 21 : Kassis JA, Noll E, VanSickle 
EP, Odenwald WF, PerrimonN. Proc Natl Acad Sci USA. 1992 Mar l;89(5):1919-23; 
22: Berg CA, Spradling AC. Genetics. 1991 Mar;127(3):515-24; 23: Tower J, Karpen 
GH, Craig N, Spradling AC. Genetics. 1993 Feb;133(2):347-59; 24: Cooley L, Berg C, 
20 Kelley R, McKearin D, Spradling A. Prog Nucleic Acid Res Mol Biol. 1989;36:99-109; 
25: Cooley L, Kelley R, Spradling A. Science. 1988 Mar 4;239(4844):1 121-8; 26: 
Spradling AC, Stem DM, Kiss I, Roote J, Laverty T, Rubin GM. Proc Natl Acad Sci U S 
A. 1995 Nov 21 ;92(24): 10824-30. 
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As used herein, the term "phenotype" means any property of a cell or organism. 
A phenotype can simply be a change in expression of an mKNA or protein. Examples of 
phenotypes also include, but are in no way limited to, cellular, biochemical, histological, 
behavioral, or whole organismal properties that can be detected by the artisan. 
5 Phenotypes include, but are not limited to, cellular transformation, cell migration, cell 
morphology, cell activation, resistance or sensitivity to drugs or chemicals, resistance or 
sensitivity to pathogenic protein localization within the cell (e.g. translocation of a 
protein from the cytoplasm to the nucleus), profile of secreted or cell surface proteins, 
(e.g., bacterial or viral) infection, post-translational modifications, protein localization 
10 within the cell (e.g. translocation of a protein from the cytoplasm to the nucleus), profile 
of secreted or cell surface proteins, cell proliferation, signal transduction, metabolic 
defects or enhancements, transcriptional activity, cell or organ transcript profiles (e.g., as 
detected using gene chips), apoptosis resistance or sensitivity, animal behavior, organ 
histology, blood chemistry, biochemical activities, gross morphological properties, life 
15 span, tumor susceptibility, weight, height/length, immune function, organ fimction, any 
disease state, and other properties known m the art. In certain situations and therefore in 
certain embodiments of the invention, the effects of mutation of one or more genes in a 
cell or organism can be determined by observing a change in one or more given 
phenotypes {e.g., in one or more given structural or functional features such as one or 
20 more of the phenotypes indicated above) of the mutated cell or organism compared to 
the same structural or fixnctional feature(s) in a corresponding wild-type or (non- 
mutated) cell or organism (e.g., a cell or organism that in which the gene(s) have not 
been mutated). 
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As used herein, the term "multiploid" means any ploidy greater than haploid. 
Multiploid encompasses diploid, triploid, tetraploid, and aneuploid. 

As used herein "library" means more than one cell. A library may be cells 
subjected to one or both mutagenesis methods, singly or more than one time. Thus a 
5 library includes, but is not limited to, one or more clones of mutated cells or mutated 
cells where each cell has a different set of mutations. Libraries provide a source of cells 
to subject to mutagenesis and a source of cells to screen for desired phenotypes 
following mutagenesis. 

A "known" gene is directed to the level of characterization of a gene. The 
10 invention allows expression of genes that have been characterized, as well as expression 
of genes that have not been characterized. Different levels of characterization are 
possible. These include detailed characterization, such as cloning, DNA, RNA, and/or 
protein sequencing, and relating the regulation and function of the gene to the cloned 
sequence (e.g., recognition of promoter and enhancer sequences, functions of the open 
15 reading frames, introns, and the like). Characterization can be less detailed, such as 
having mapped a gene and related function, or having a partial amino acid or nucleotide 
sequence, or having purified a protein and ascertained a function. Characterization may 
be minimal, as when a nucleotide or amino acid sequence is known or a protein has been 
isolated but the function is unknown. Altematively, a function may be known but the 
20 associated protein or nucleotide sequence is not known or is known but has not been 
correlated to the function. Finally, there may be no characterization m that both the 
existence of the gene and its function are not known. The invention allows expression 
of any gene at any of these or other specific degrees of characterization. 
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The invention could also be practiced with a combination of physicochemical or 
insertional mutation with epigenetic techniques for modifying genome function, such as 
modification of methylation, is RNA antisense RNA, ribozymes, transcription factors 
designed to activate or inactivate multiple transcriptional regulatory sequences, 
5 modifiers of chromosomal proteins, such as histones, that modify the transcriptional 
availability of multiple genes. 

Overview 

The ability to create tagged mutations in multiple genes in multiploid cells and 
multicellular organisms would have utility in many areas, including correlating a 
10 phenotype with the genes responsible for it by gene identification, gene discovery, 
determining gene function, creating phenotypes, discovering drug targets, and making 
human disease models in cells and in multicellular organisms. 

The ability to create tagged homozygous mutations in a cell or multicellular 
organism enables alteration of the genetic make up of a cell and has numerous uses, such 
15 ' as those above as well as correcting genetic defects. The in vitro, ex vivo, and in vivo 
potential uses of this technology are enormous and will be readily apparent to the skilled 
artisan. 

The present invention, therefore, is directed to methods for mutating a single 
gene or multiple genes {e,g,, two or more) in eukaryotic cells and multicellular 
20 organisms. The invention also is directed to insertional mutagens for making the 
mutant cells and organisms, and which also can be used to analyze the mutations that are 
made in the cells and organisms. The invention also is directed to methods in which one 
or more mutated genes is tagged by a tag provided by the insertional mutagen to allow 
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the detection, selection, isolation, and manipulation of a cell with a genome tagged by 
the insertional mutagen and allows the identification and isolation of the mutated 
gene(s). 

The invention provides methods for making multiple mutations (i.e., mutations in 
5 two or more genes that produce a phenotype cumulatively) in cells and organisms and 
tagging at least one of the mutated genes such that it can be rapidly recovered and 
identified. Creation of multiple mutations in a cell where at least one of the mutations is 
tagged is useful in studying gene function. One reason for this is that many phenotypes 
require multiple gene mutations in order to be manifested. Current methods do not 
10 allow for creation of multiple mutations in a cell in a maimer that allows easy 
identification of the mutated genes. The present invention enables multiple mutations to 
be created in the same cell and allows at least one of the mutations to be tagged. 

Libraries that contain the cells mutagenized by a combination of insertional 
mutagenesis and physicochemical mutagenesis can be screened for a phenotype of 
15 interest. In cells that have the phenotype of interest, one or more tagged genes can be 
identified and validated as being responsible for the particular phenotype of interest. 

The mvention also provides methods for making homozygous mutations in 
eukaryotic cells and organisms. The homozygously mutated gene is tagged by an 
insertional mutagen so that it can be identified and, if desired, recovered. Homozygous 
20 mutations are useful for discovering functions associated witibi the mutated gene. 

The methods of the present invention can be used to mutate any eukaryotic cell, 
including, but not limited to, haploid (in the case of multiple gene mutations), diploid, 
triploid, tetraploid, or aneuploid. In one embodiment, the cell is diploid. Cells in which 
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the methods of the present invention can be advantageously used include, but are not 
limited to, primary cells (e.g., cells that have been explanted directly firom a donor 
organism) or secondary cells (e.g., primary cells that have been grown and that have 
divided for some period of time in vitf^o, e.g., for 10-100 generations). Such primary or 
5 secondary cells can be derived from multi-cellular organisms, or single-celled 
organisms. The cells used in accordance with the invention include normal cells, 
terminally differentiated cells, or immortalized cells (including cell lines, which can be 
normal, estabUshed or transformed), and can be differentiated (e.g., somatic cells or 
germ cells) or undifferentiated (e.g., mxiltipotent, pluripotent or totipotent stem cells). 

10 Examples of tissues from which cells can be isolated for use in the present 

invention include, without Umitation, neuronal tissue (including tissue from the central 
and peripheral nervous systems), hematopoietic tissue, lymphatic tissue, immune tissue, 
bone tissue, stromal tissue (including, e.g., bone marrow tissue), mesenchymal tissue, 
mesothelial tissue, connective tissue (including e.g., cartilage, dermal tissue, 

15 subcutaneous tissue, adipose tissue, etc.), endothelial tissue, epitheUal tissue, lung tissue, 
skin tissue, kidney tissue, gastrointestinal tissue (including esophagus, stomach, 
intestine, etc.), brain tissue, heart tissue, pancreatic tissue, muscle tissue, liver tissue, 
gonadal tissue, embryonic tissue including embryonic stem cells and embryonic germ 
cells), zygote tissue, embryonic, and other cells and tissue known in the art. 

20 A variety of cells isolated from the above-referenced tissues, or obtained from 

other sources (e.g., commercial sources or cell banks), can be used in accordance with 
the invention. Non-limiting examples of such cells include somatic cells such as blood 
cells (erythrocytes and leukocytes), endothelial cells, epitheUal cells, neuronal cells 
(from the central or peripheral nervous systems), muscle cells (including myocytes and 
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myoblasts firom skeletal, smooth or cardiac muscle), comiective tissue cells (including 
fibroblasts, adipocytes, chondrocytes, chondroblasts, osteocytes and osteoblasts) and 
other stromal cells {e.g., macrophages, dendritic cells, thymic nurse cells, Schwann cells, 
etc.). Eukaryotic germ cells (spermatocytes and oocytes) can also be used in accordance 
5 with the invention, as can the progenitors, precursors and stem cells that give rise to the 
above-described somatic and genu cells. These cells, tissues and organs can be normal, 
or they can be pathological such as those involved in diseases or physical disorders, 
including but not limited to infectious diseases (caused by bacteria, fungi or yeast, 
viruses (including HIV) or parasites), in genetic or biochemical pathologies {e.g., cystic 
10 fibrosis, hemophilia, Alzheimer's disease, schizophrenia, muscular dystrophy, multiple 
sclerosis, etc.), or in carcinogenesis and other cancer-related processes. 

The eukaryotic cells used in the methods of the present invention can be animal 
cells, plant cells (monocot or dicot plants) or fungal cells, such as yeast. Animal cells 
include those of vertebrate or invertebrate origm. Vertebrate cells are of particular use 

15 in the present invention, especially mammalian cells (including, but not limited to, cells 
obtained or derived fi-om human, simian or other non-human primate, mouse, rat, avian, 
bovine, porcine, ovine, canine, feline and the like), avian cells, fish cells (including 
zebrafish cells), insect cells (including, but not limited to, cells obtained or derived from 
Drosophila species, from Spodoptera species {e.g., SJ9 obtained or derived from S. 

20 frugiperida, or HIGH FIVE™ cells) or from Trichoplusa species {e.g., MGl, derived 
fi-om T. ni)), worm cells {e.g., those obtained or derived from C. elegans), and the like. 
It will be appreciated by the ordinarily skilled artisan, however, that cells from any 
species besides those specifically disclosed herein can be advantageously used in 
accordance v^th the methods of the present invention, using art-known methods in 
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conjiinction with those described herein and without the need for undue 
experimentation. 

Cell lines are also useful in liie present invention. Examples of useful cell lines 
include, but are not Imiited to, HT1080 cells (ATCC CCL 121), HeLa cells and 

5 derivatives of HeLa cells (ATCC CCL 2, 2.1 and 2.2), MCF-7 breast cancer cells 
(ATCC BTH 22), K-562 leukemia cells (ATCC CCL 243), KB carcinoma cells (ATCC 
CCL 17), 2780AD ovarian carcinoma cells (see Van der Blick, A. M. et al.. Cancer Res. 
^5:5927-5932 (1988), Raji cells (ATCC CCL 86), Jurkat cells (ATCC TIB 152), 
Namalwa cells (ATCC CRL 1432), HL-60 cells (ATCC CCL 240), Daudi cells (ATCC 

10 CCL 213), RPML 8226 cells (ATCC CCL 155), U-937 cells (ATCC CRL 1593), Bowes 
Melanoma cells (ATCC CRL 9607), WI-38VA13 subline 2R4 cells (ATCC CLL 75.1), 
and MOLT-4 cells (ATCC CRL 1582), as well as heterohybridoma cells produced by 
fusion of human cells and cells of another species. Secondary human fibroblast strains, 
such as WI-38 (ATCC CCL 75) and MRC-5 (ATCC CCL 171) can also be used. Other 

15 mammaUan cells and cell lines can be used in accordance with the present invention, 
including but not limited to CHO cells, COS cells, VERO cells, 293 cells, PER-C6 cells. 
Ml cells, NS-1 cells, COS-7 cells, MDBK cells, MDCK cells, MRC-5 cells, WI-38 
cells, WEHI cells, SP2/0 cells, BHK cells (includhig BHK-21 cells); these and other 
cells and cell lines are available commercially, for example from the American Type 

20 Culture Collection (P.O. Box 1549, Manassas, Virginia 20108 USA). Many other cell 
lines are known in the art and will be familiar to the ordinarily skilled artisan; such cell 
lines therefore can be used equally well in the methods of the present invention. 
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The present invention can be practiced using plant cells. Methods for culturing 
plant cells, insertionally mutating plant cells, and producing transgenic plants are known 
in the art (see, e.g.. Hall, Robert D., Plant Cell Culture Protocols, Humana Press, New 
Jersey (1999); Gartland and Davey, Agrobacterium Protocols, Humana Press, New 
5 Jersey (1995); each incorporated herein by reference for teaching methods of ciilturing, 
transfecting, mutating, and producing transgenic plants and plant cells). 

In certain embodiments of the invention, cells can be mutated within the 
organism or within the native environment as in tissue explants {e.g., in vivo or in situ). 
Alternatively, tissues or cells isolated from the organism usmg art-known methods and 
10 genes can be mutated according to the present methods. The tissues or cells are either 
maintained in culture {e.g., in vitro), or re-implanted into a tissue or organism {e.g., ex 
vivo). 

Methods of Making Mutant Cells 

The invention involves two types of mutagenesis events - physicochemical and 
15 insertional. In one aspect, multiple genes are mutated which cumulatively produce a 
desired phenotype. The mutations can be homo2ygous, heterozygous, or a combination 
of the two. One or more of these genes is tagged by means of an insertional mutagen. A 
single cell can contain all of the tagged genes or less than all of the tagged genes, for 
example, one tagged gene. Accordingly, m a non-limiting example two mutated genes 
20 in the same cell are reqmred to produce the desired phenotype. Cells are mutated to 
produce the phenotype, screened for the desired phenotype and two cells are selected 
that have the phenotype because both genes are mutated m each cell. One gene is tagged 
in one cell and the other gene is tagged in the second cell. This process allows the 
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identification of both genes that cumulatively produce the desired phenotype even 
though only one gene was tagged in each cell. 

Furthermore, the present invention is useful even if only one gene responsible for 
a phenotype is identified. Such a gene could form a target for modulating the 
5 phenotype. This is useful, for example, in drug discovery and in production of useful 
gene products. Accordingly, one or more of multiple genes required to produce a 
desired phenotype can be identified using the mutagenic methods of the invention. 

The invention also involves using the mutagenic methods to produce a desired 
phenotype that resuhs from the homozygous mutation of a single gene. At least one of 
10 the mutated alleles is tagged. 

The mutation process involves at least one physicochemical mutagenic event, 
and at least one insertional mutagenic event, which can be carried out in any order or 
simultaneously. 

Physicochemical Mutagenesis 

15 One phase of the mutational methods of the present invention uivolves 

physicochemical mutagenesis of a cell or organism by one or more physicochemical 
mutagens. In one embodhnent, physicochemical mutagenesis is the initial event in the 
mutagenic process. In another embodiment, it can be performed simultaneously with an 
insertional mutagenesis event. In another embodiment it follows an initial insertional 

20 mutagenesis event. In any of these embodiments one or more further physiochemical 
and/or insertional mutagenesis events can be carried out in any order and 
simiiltaneously. 
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Physicochemical mutagenesis can be achieved by a variety of mutagenic agents. 
Examples of mutagenic agents known in the art include, but are not limited to, chemical 
mutagens (e.g., DNA-intercalating or DNA-binding chemicals which affect (e.g,, 
increase or decrease) the activity, protein coding potential or expression of a gene 
contained on a DNA molecule to which the chemical has bound), physical mutagens 
(e.g., UV light, ionizing radiation, (gamma, beta and alpha radiation, x-rays), 
biochemical mutagens (e.g., restriction enzymes, DNA repair mutagens, DNA repair 
inhibitors, and error-prone DNA polymerases and repUcation proteins), and the like. 
The mutagenic changes in DNA sequence can occur as a direct consequence of the 
mutagen/DNA interaction. Alternatively DNA repair mechanisms induced in response 
to damage inflicted by the mutagen may participate in implementing mutations. , 

In certain embodiments, chemical mutagenesis is used to induce mutation in one 
or more genes in the target cell or organism. An example of a chemical mutagen 
commonly used to mutate cells and organisms is N-ethyl-N-nitrosourea (ENU). Other 
examples of chemical mutagens useful in the present invention include, but are not 
limited to, ethyhnethanesulphonate (EMS) and ICR191. Many other chemical mutagens 
are known in the art and are useful in the present invention (see, e.g. , E. C. Friedberg, G. 
C. Walker, W. Siede, DNA Repair and Mutagenesis, ASM Press, Washington DC 
(1995); 1: Justice MJ, Zheng B, Woychik RP, Bradley A. Methods. 1997 
Dec;13(4):423-36; 2: Gasparro FP, Felli A, Schmitt IM. Recent Results Cancer Res. 
1997;143:101-27; 3: Vogel EW, Natarajan AT. Mutat Res. 1995 Aug;330(l-2):183- 
208; 4: Anderson P. Methods Cell Biol. 1995;48:31-58; 5: Russell LB. Environ Mol 
Mutagen. 1994;23 Suppl 24:23-9; 6: DeMarini DM, Brockman HE, de Serres FJ, Evans 
HH, Stankowski LF Jr, Hsie AW. Mutat Res. 1989 Jan;220(l): 11-29; 7: Wood RD, 
Sedgwick SG. Mutagenesis. 1986 Nov; 1(6): 3 99-405; 8: Loprieno N, Barale R, Von 
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Halle I^.von Borstel RC. Mutat Res. 1983 Jim; 115(2):21 5-23; 9: Bradley MO, 
Bhuyan B, Francis MC, Langenbach R, Peterson A, Huberman E. Mutat Res. 1981 
Sep;87(2):81-142; 10: Hoffmann GR. Mutat Res. 1980 Jan;75(l):63-129; 1: Flessel 
CP. Adv Exp Med Biol. 1977;91:1 17-28; 12: McCarni J, Ames BN. Proc Natl Acad Sci 
5 USA. 1976 Max;73(3):950-4; 13: Orias E, et al. Methods Cell Biol. 1976;13:247-82; 
14: Kilbey BJ. Methods Cell Biol. 1975;12:209-31 incorporated herein by reference for 
teaching chemical mutagens and their use in inducing gene mutation in various cells and 
organisms). 

Preferred doses of chemical mutagens for inducing mutations in cells and 
10 organisms are described elsewhere herein, are known in the art, or can be readily 
determined by the ordinarily skilled artisan using assays of mutagenesis described herein 
or known in the art. Chemical mutagenesis of cells in vitro can be achieved by treating 
the cells with various doses of the mutagenic agent and/or controlling the time of 
exposure to the agent. By titrating the mutagenic agent exposure and/or dose, it is 
15 possible to carry out the optimal degree of mutagenesis for the intended purpose, thereby 
mutating a desired number of genes m each cell. Examples of useful doses of ENU are 
0.1 - 0.4 mg/ml for 1 - 2 hours. Examples of useful doses of EMS are 0.1-1 mg/ml for 
10-30 hours. While the treatment conditions described herein are known to be useful in 
accordance with the present invention, lower and higher doses and exposure times can 
20 be used to achieve the desired mutation frequency. In addition, optimal doses and 
exposure times will vary from cell type to cell type and can be readily determined using 
the methods of the invention, knowledge that is readily available in the art, and routine 
experimentation. Other chemical mutagens and mutagenic agents are used at doses 
known in the art or determined through routine experimentation. 
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Physicochemical mutagenesis caa be carried out using a single mutagenic agent. 
Alternatively, physicochemical mutagenesis can be carried out using multiple (more 
than one) mutagenic agents. When multiple mutagenic agents are used, mutagenesis can 
be carried out by exposing the cells to the agents at the same time or sequentially. 
5 Physicochemical mutagenesis of a given cell or organism can also be carried out in a 
single exposure or in multiple sequential exposures, in any order. 

If physicochemical mutagenesis of a cell or organism is performed prior to 
another mutagenic event (e.g., insertional mutagenesis or another roimd of 
physicochemical mutagenesis) carried out on the same cell or organism, the mutation 
10 frequency in the physicochemically mutated cell or organism optionally can be 
determined prior to or following the additional mutagenic event (e.g., prior to or 
following insertional mutagenesis or the next roxmd of physicochemical mutagenesis). 
This approach allows the artisan to select the mutagenic agent and conditions that 
achieve the desired mutation jfrequency. 

15 Mutation frequency obtained with a particular mutagen can be assessed using a 

nimiber of methods described herein and known in the art. One such method is to test 
for gene function of a selectable or screenable marker within the cell. Typically, the 
marker used in this approach exists in the cell as a single copy gene; however, multi- 
copy genes could be useful in such methods, particularly to assess high mutation 

20 frequencies. The marker gene can be a cellular gene, such as HPRT, or could be an 
exogenously introduced gene, such as HSV-thymidine kinase, green fluorescent protein, 
luciferase, p-galactosidase, a cell surface protein suitable for FACS sorting, and the like. 
Following mutagenesis, the artisan can assay for the absence of, or reduction in, marker 
gene expression or activity. For example, mutation frequency can be assessed m 
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mutagenized cells by selecting against HPRT (endogenous HPRT is X-linked and 
therefore, there is normally only one functional copy in a given cell) or HSV-TK (in 
cells transfected with a single copy of the HSV-TK gene) using selection with 6- 
thioguanine (6-TG) and l,2'-deoxy-2'-fluoro-o-D-arabinofuranosyl-5-iodouracil (FAIU), 
5 respectively. The number of surviving clones divided by the total number of clones 
plated then defines the mutation frequency for single copy genes. Thus, a desired 
mutation frequency can be obtained in any cell or organism by mutagenizing cells under 
several conditions, assaying for marker activity, and determining mutation frequency. 
With respect to the other marker examples described above, and those known in the art, 
10 any suitable assay can be used to test for loss of marker function (or gain of function) 
including enzyme assays, ELISA, and FACS. Once a desired level of mutation in a cell 
population is obtained, one or more cells from that population can be used in the second 
phase of the process, as described below. 

Another method for assessing mutation frequency is to ampUfy (e.g., via PGR) a 
15 genomic locus or cDNA from one cell (or a population of cells) and clone the 
amplification product(s), and sequence the cloned nucleotides. By comparing the 
sequence from the mutagenized cell to the sequence from the wild-type (non- 
mutagenized) cell, it is possible to determine the number of mutations per unit length, 
(e.g., one mutation per 1000 bp). A variety of permutations of this method, and similar 
20 methods, will be readily apparent to a person of skill in the art. Thus, this assay can be 
used to select mutagenesis conditions that give rise to a desired mutation frequency. 
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The present invention involves mutagenizing cells with physicochemical 
mutagens, such as those mutagens that are well-known in the art and that therefore will 
be famiUar to the ordinarily skilled artisan, such that one or more genes are mutagenized 
in each cell. 

5 Any number of genes can be mutagenized in each cell. To select a mutation 

frequency and library coverage, it is useful to consider that there is a relationship 
between the total number of genes contained in the genome of the cell, the nmnber of 
mutated genes produced in each cell, the number of mutated cells generated, and the 
complexity of the library. In general, as the number of genes in a cell increases (e.g. 

10 yeast versus mammalian cells), the number of mutations per cell or the number of 
mutated cells (or both) must increase m order to produce the same library coverage. 

As a general gxiideline, creation of a larger number of mutations per cell reduces 
the number of cells required to produce a library of cells in which every gene in the 
genome has been mutated, on average, at least one time. Likewise, as the mutation 

15 frequency increases, the size of the library can decrease without affecting its coverage. 
For example, if one copy of a given gene is mutated at a frequency of approximately 1 
cell out of 100 sxirviving mutagenized cells, then 100 mutant cell clones will be 
necessary to produce a library in which one copy of each gene has been mutated, on 
average, one time. However, if one copy of a gene is mutated at a frequency of 1 cell in 

20 30 cells, then only 30 mutant clones are necessary to produce the same IX library. A 
library of this size can be referred as a IX library to indicate that one copy of each gene 
has been mutated one time, on average. 
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In preferred embodiments, at least 50-10,000 (one mutated copy of 0.05-10% of 
the total number of alleles in the diploid genome) genes are mutated in each cell. In 
highly preferred embodiments, at least 250-5,000(one mutated copy of 0.25-5% of the 
total number of alleles in the diploid genome) genes are mutated in each cell. In other 
5 embodiments, it is useful to mutate 10-100 genes in each cell. In still other 
embodiments, it is useful to mutate 10,000-50,000 genes, or more. As the number of 
mutations per cell increases it becomes increasingly likely that both alleles of some 
genes will be mutated as well as becoming increasingly likely that multiple genes that 
are needed for a particular biological function will be mutated in a single cell. Thus 
10 mutagenizing a cell can sensitize this cell or libraries of similarly mutagenized cells to 
the induction of phenotypes as a consequence of further mutation (e.g. insertional 
mutagensis) on this sensitized genetic background. 

It should be noted that a IX library is only an estimate of library complexity. 
Statistically, assuming a random mutation frequency, a IX library will contain about 

15 66% of the genes with a single mutation. Some genes may be mutated twice (where the 
genome in the cell is diploid) and others will not be mutated. To increase the probability 
that most or all genes in a given library are mutated at least once, a library of increased 
complexity can be used (e.g., a 5X or lOX library can be used). In other embodiments 
that do not require mutation of every gene in a genome, libraries of less complexity can 

20 be created (e.g. , a 0. IX (or less) library). 

It is also possible to estimate library size and coverage by assessing the mutation 
frequency of individual genes in the library. For example, if HPRT and HS V-TK are 
both mutated at a frequency of 1 in 500 cells under a specific mutation condition, then a 
IX library could be estimated to contain 500 cells. Such libraries of increased or 
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decreased complexity can be created by manipulating the conditions of physicochemical 
mutagenesis to which the cells are exposed, e.g., by increasing the dose and/or exposure 
time of the physicochemical mutagen(s) to make libraries of increased complexity, or by 
decreasing the dose and/or exposure time of the physicochemical mutagen(s) for 
5 preparation of libraries of decreased complexity. 

If physicochemical mutagenesis is carried out prior to insertional mutagenesis, 
the physicochemically mutated cells or organism optionally can be screened at this time 
for a desired phenotype. This screening can be done whether or not the mutation 
frequency in the physicochemically mutated cells is determined. Mutant cells tliat 

10 display the desired phenotype prior to insertional mutagenesis can then be discarded; it 
is desirable to remove cells displaying the desired phenotype at this stage since they 
would interfere with subsequent screening by displaying the desired phenotype 
independent of insertional mutagenesis. As disclosed herein, insertional mutagenesis 
provides a way to tag a gene that is involved in the induction of a desired phenotype. If 

15 the phenotype is already produced by the physicochemical mutagenesis of a gene, then 
an insertional tag caimot be used to correlate the tagged gene with the phenotype since 
the phenotype is not induced by the insertion event. Thus, in one embodiment of the 
present invention, cells are assayed for desired phenotypes prior to insertional mutation 
to identify and remove cells that have acquired the phenotype without insertional 

20 mutagenesis. 

In another embodiment of the invention where physicochemical mutagenesis is 
carried out prior to insertional mutagenesis, phenotypes are not assessed prior to 
insertional mutation because it is unlikely that the desired phenotype will be produced. 
In this embodiment, it is preferable to mutate cells under conditions that yield a low 
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number of mutations per cell because it is unlikely that the desired phenotype will be 
produced. The reason for this is that when a large number of mutations are made in each 
cell, for example 10,000 mutated genes per cell, the probability of mutating two genes or 
alleles to create a phenotype is relatively high. Conversely, when a small number of 
mutations is made in each cell, for example 100 mutated genes per cell, the probability 
of mutating two or more required genes or alleles in the same cell to create a phenotype 
is low. Furthermore, the number of physicochemically mutated cells selected for 
insertional mutagenesis also impacts the probability that a cell will be selected that has 
the phenotype prior to insertional mutagenesis. For example, if 1,000 mutations are 
created in each cell by physicochemical mutagenesis, but only one cell is selected for 
insertional mutagenesis, the probability that this one cell possesses the desired 
phenotype can be relatively low. On the other hand, if 10 gene mutations are created in 
each cell by physicochemical mutagenesis, but 10^ physicochemically mutated clones 
are selected for insertional mutagenesis, then the probability that one or more cells 
possess the desired phenotype prior to insertional mutagenesis can be relatively high. 
Thus, there is a correlation between the mmiber of physicochemical mutations per cell, 
the number of physicochemically mutated cells selected for insertional mutagenesis, and 
the probability of selecting one or more cells displaying a phenotype prior to insertional 
mutagenesis. One method for empirically assessing this relationship for a given 
phenotype is to physicochemically mutate cells under several conditions and then test 
for the phenotype frequency prior to insertional mutagenesis. If, for example, the 
phenotype frequency for a particular physicochemical mutation is found to be 1 mutant 
phenotype observed per 1000 cells, by selecting 50 cells there is only approximately a 
5% probability of selecting a cell displaying the phenotype of interest. Another 
approach for assessing the likelihood of selecting a physicochemically mutated cell 
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displaying the desired phenotype is to detemiine the mutation frequency in a given 
population of mutated cells using one of the assays described above {e.g., using HPRT 
or HSV-TK selection assays). If, for example, the mutation frequency for a single copy 
gene is 1 mutation in 400 clones, then the probability of mutating both alleles of a gene 
5 in a diploid cell is roughly 1 in 160,000 (1/400 x 1/400). By selecting 4000 cells for 
insertional mutagenesis, the probability of cells displaying a phenotype caused by a 
homozygous gene mutation is, therefore, relatively low. 

The foregoing assays for determining mutation frequency are intended as 
non-limitmg examples provided solely to unpart guidelines to the artisan for selecting 
10 cells for insertional mutagenesis. As the ordinarily skilled artisan will be aware, there 
are a number of other assays of mutation frequency that can be employed in conjunction 
with the present invention; the exemplary assays described herein therefore should not . 
be construed as limiting the scope of the invention in any way. 

Insertional Mutagenesis 

15 The other mutational method of the present invention is insertional mutagenesis 

of a cell or organism by one or more insertional mutagens (which can be the same or 
different). In one embodiment, insertional mutagenesis is the mitial mutagenic event 
and precedes a physicochemical mutagenic event. In another embodiment, insertional 
mutagenesis is carried out simultaneously with physicochemical mutagenesis. In 

20 another embodiment, an insertional mutagenesis event follows a physicochemical 
mutagenesis event. In any of these embodiments, the cells can subsequently be 
subjected to one or more additional physicochemical mutagenesis events, one or more 
additional msertional mutagenesis events, or a combination of any number of these 
events in any order or simultaneously. The coupling of insertional mutagenesis with one 
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or more additional mutational events provides the ability to create mutations, in cells or 
organisms, that have previously been difficiilt or impossible to produce or to detect. 

According to the invention, insertional mutagenesis involves the integration of 
one or more polynucleotide sequences into the genome of a cell or organism to mutate 
5 one or more endogenous genes in the cell or organism. Thus, the insertional mutagenic 
polynucleotides of the present invention are designed to mutate one or more endogenous 
genes when the polynucleotides integrate into the genome of the cell. 

Insertional Mutagens 

Accordingly, the insertional mutagens used in the present invention can comprise 
10 any nucleotide sequence capable of altering gene expression levels or activity of a gene 
product upon insertion into DNA that contains the gene. The insertional mutagens can 
be any polynucleotide, including DNA and RNA, or hybrids of DNA and RNA, and can 
be single-stranded or double-stranded, naturally occurring or non-naturally occurring 
(e.g., phosphorothioate, peptide-nucleic acids, etc.). The insertional mutagens can be of 
15 any geometry, including but not limited to linear, circular, coiled, supercoiled, branched, 
hairpin, and the like, and can be any length capable of facilitating mutation, and taggmg 
of an endogenous gene. Typically, the insertional mutagens are at least 5 nucleotides in 
length, at least 10 nucleotides in length, at least 15 nucleotides in length, at least 20 
nucleotides in length, at least 25 nucleotides in length, at least 50 nucleotides in length, 
20 at least 100 nucleotides in length, at least 200 nucleotides in length, at least 250 
nucleotides in length, at least 500 nucleotides in length, at least 1000 nucleotides {e.g., at 
least 1 kb) in length, etc. In some embodiments of the mvention, the insertional 
mutagens can be at least 1 kb in length, at least 2 kb in length, at least 2.5 kb in length, at 
least 5 kb in length, at least 7.5 kb in length, at least 10 kb in length, or larger. 
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Preferably, the insertional mutagens at least 10-15 nucleotides in length. This length 
allows the artisan to use nucleotide primers complementary to the inserted 
polynucleotide to be used to make primer extension products of the insertionally 
mutated gene to detect or characterize (e.g., sequence) the gene. 

In certain embodiments of the invention, the insertional mutagens can comprise 
one or more nucleotide sequences that provide a desired function. Such nucleotides 
sequences include, but not are not limited to, one or more multiple cloning sites, one or 
more transcription termination sites, one or more transcriptional regulatory sequences 
{e.g., one or more promoters, enhancers, or repressors), one or more sequences that 
encode translational signals, one or more open reading frames (ORFs), one or more 
sequences mutating ORFs, one or more stop codons, one or more sequences mutating or 
eliminating stop codons, one or more mRNA destabiUzing elements, one or more RNA 
stabilizing elements, one or more sequences that result in the formation of hairpin loops, 
one or more sequences that disrupt or eliminate hairpin loops, one or more reporter 
genes, one or more splice acceptor sequences, one or more splice donor sequences, one 
or more intemal ribosome entry sites (IRES), one or more transposon sequences, one or 
more site-specific recombination site sequences, one or more restriction enzyme 
sequences, one or more nucleotide sequences encoding a fusion partner protein or 
peptide ie,g,, glutathione-S-transferase (GST), hexahistidine (Hise) or thioredoxin), one 
or more selectable markers or selection modules, one or more bacterial sequences useful 
for propagating the insertional mutagenic polynucleotide molecules in a host cell, one or 
more 3' gene trap cassettes, one or more nucleotide sequences encoding localization 
signals such as nuclear localization signals or secretion signals, one or more nucleotide 
sequences encoding one or more transmembrane regions {e.g., one or more amino acids, 
and typically one or more hydrophobic ammo acids, capable of anchoring a polypeptide 
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into a cellular membrane), one or more origins of replication, one or more protease 
cleavage sites, one or more desired proteins or peptides encoded by a gene or a portion 
of a gene, a 5' gene traps on an insertional mutagen, a 3' gene trap on an insertional 
mutagen, one or more selectable markers, one or more sequences encoding one or more 
5 5' or 3' polynucleotide tails (particularly a poly (A) tail), and the like. As the ordinarily 
skilled artisan will readily understand, the insertional mutagens of the invention can 
comprise one or more of these or other nucleotide sequences, in any order and 
combination, and can comprise more than one of a given nucleotide sequence. 

Preferably, the insertional mutagens comprises a selectable marker, a 5' gene 

10 trap, a splice acceptor with no operatively linked promoter, stop codons in all three 
reading frames, an IRES, a transposon sequence, or a site specific recombination site. 

In one specific embodiment, the insertional mutagen comprises a splice acceptor 
sequence that does not contain an operably-linked promoter 5' to the splice acceptor. 
This polynucleotide can serve as an essential 5' gene trap. In certain other specific 

15 embodiments, the insertional mutagen is found on a retroviral vector. The retrovirus can 
be infectious or non-infectious but in a preferred embodiment the retrovirus vector can 
form infectious retrovirus particles. In another specific embodiment, the insertional 
mutagen contains sequences required for transposition and accordingly forms a 
transposable element. In a fiirther specific embodiment, the insertional mutagen 

20 comprises a polynucleotide having a splice acceptor with no operably-linked promoter, 
the splice acceptor having an optimal branch point, the mutagen also containing three 
stop codons in all three reading firames, an IRES that includes an exonic splicing 
enhancer, the mutagen fiirther comprising a selectable marker with a polyadenylation 
site operably-linked, and a polyA trap component. The polyA trap component can 

25 optionally be constructed as described herein and as generally known ip. the art. PolyA 
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traps are also discussed in U.S. Patent No. 6,410,266, incorporated herein by reference 
for disclosing polyA traps. In anotiier embodiment, the insertional mutagen contains a 
splice acceptor without an operably-linked promoter 5', the splice acceptor containing an 
optimal branch point, the vector further containing three stop codons in all three reading 
5 frames, an IRES that does not contain the enhancer, a selectable marker with an 
operably-linked polyadenylation site, and wherein the vector does not contain the polyA 
trap. 

In certain embodiments, the insertional mutagens comprise one or more 
nucleotide sequences capable of mutating an open reading frame (ORF). For example, 
10 the insertional mutagen can contain a number of nucleotides that is not divisible by 3, 
and which, therefore, would result in a frame-shift upon insertion of the polynucleotide 
into an ORF. 

In another embodiment, the insertional mutagen comprises one or more primer 
recognition sites, thereby facilitating the detection of the mutated gene using primer- 
1 5 based amplification or sequencing methods such as PGR. 

In certain other embodiments of the invention, the insertional mutagens can 
additionally or alternatively comprise one or more stop codons. Stop codons are 
useful for terminating translation of genes, thereby facilitating mutation of a functional 
. protein. The stop codons can be located on one or both strands of a double stranded 
20 insertional mutagen and can be nested to terminate translation in all three reading 
frames. 
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In certain other embodiments of the invention, the insertional mutagens can 
additionally or alternatively comprise one or more mRNA destabilizing elements. Upon 
integration into a gene and incorporation into an mRNA produced by the gene, the RNA 
instability element will decrease the amomt of mRNA from the gene. A number of 
5 mRNA instability elements are known in the art and useful in the present invention (see, 
for example, Shaw et al, Cell 46:659-661 (1986); Ishida et al. Nucleic Acids Research 
27:e35 (1999) each mcorporated herein by reference for teaching RNA mstability 
elements and methods of using such elements). 

In certain other embodiments of the invention, the msertional mutagens can 

10 additionally or alternatively comprise one or more selectable markers. A selectable 
marker is a gene that encodes an expression product that can be selected for or against. 
Examples of selectable markers include but are not limited to: (1) polynucleotide 
segments that encode products which provide resistance against otherwise toxic 
compounds (e.g., antibiotics or other drugs); (2) polynucleotide segments that encode 

1 5 products which are otherwise lacking in the recipient cell {e.g. , tRNA genes, auxotrophic 
markers); (3) polynucleotide segments that encode products which suppress the activity 
of a gene product; (4) polynucleotide segments that encode products which can be 
readily identified (e.g., phenotypic markers such as P-galactosidase, green fluorescent 
protein (GFP), and cell surface proteins); (5) polynucleotide segments that bind products 

20 that are otherwise detrimental to cell survival and/or function; (6) polynucleotide 
segments that otherwise inhibit the activity of any of the polynucleotide segments 
described in (l)-(5) above (e.g., antisense oligonucleotides); (7) polynucleotide 
segments that bind products that modify a substrate (e.g., methylases and restriction 
endonucleases); (8) polynucleotide segments that can be used to isolate or identify a 

25 desired molecule (e.g. specific protem binding sites); (9) polynucleotide segments that 
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encode one or more screenable markers; (10) polynucleotide segments, which when 
absent, directly or indirectly confer resistance or sensitivity of the cell to particular 
compoimds; (11) polynucleotide segments that encode products which are toxic in 
recipient cells; (12) polynucleotide segments that inhibit replication, partition or 

5 heritability of nucleic acid molecules that contain them; and (13) polynucleotide 
segments that encode conditional replication functions, e.g., replication in certain hosts 
or host cell strains or imder certain environmental conditions (e.g., temperature, 
nutritional conditions, etc.). In the present invention, selectable markers allow the 
detection of integration of the insertional mutagens into the host cell genome. In 
10 addition, selectable markers can be positioned on the insertional mutagens to allow 
selection for insertion events that occur in transcriptionally active or silent regions of the 
genome (see Figures 2-6 for non-limiting examples). The selectable marker can be 
expressed from a promoter on the insertional mutagen that is inserted or from a promoter 
located in the polynucleotide to be mutated (see Figure 7 for non-limiting examples). 

15 Selectable markers suitable for use in accordance with this aspect of the invention 
include positive selectable markers and negative selectable markers. 

A positive selectable marker allows cells expressing the selectable marker to 
survive selection, whereas cells not expressing the selectable marker die during 
selection. Examples of positive selectable markers include, but are not limited to, 

20 neomycin resistance gene (neo), puromycin resistance gene (puro), zeomycin resistance 
gene (zeo), hygromycin resistance gene (hyg), histidine D (his D), dihydro-oratase, 
glutamine synthetase (gs), aspartate transcarbamylase, xanthine guanine phosphoribosyl 
transferase (gpt), carbamyl phosphate synthase (cad), multidrug resistance 1 (mdrl), 
thymidine kinase (tk), and hypoxanthine phosphoribosyl transferase (HPRT). Other 

25 suitable positive selectable markers are known in the art and will be familiar to the 
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ordinarily skilled artisan. In accordance with the invention, the selectable marker can be 
expressed firom a promoter on the insertional mutagen that is inserted, or from a 
promoter located in the DNA to be mutated. Accordingly, in the present invention, a 
positive selectable marker can be used, for example, to select for cells that have 
5 integrated the insertional mutagen (regardless of location in the genome) (Figure 7, 
selection for SM) or for cells in which the msertion is into a transcriptionally active gene 
(for examples, see Figures 2-6). 

A negative selectable marker causes cells expressing the selectable marker to die 
during selection, whereas those cells not expressing the selectable marker survive 

10 selection. Examples of negative selectable markers include but are not limited to HPRT, 
thymidine kinase, cholera toxin, pertussis toxin, tetanus toxin, and diphtheria toxin. 
Other negative selectable markers are known in the art and will be familiar to the 
ordinarily skilled artisan. The negative selectable marker used in accordance with the 
invention can be advantageously expressed from a promoter on the insertional mutagen, 

15 or from a promoter located in the DNA to be mutated. In the present invention, a 
negative selectable marker can be used, for example, to select against cells where 
insertion is into a transcriptionally active gene. The presence of a negative selectable 
marker in combination with site-specific recombination signals can also be used to select 
against cells that have failed to delete the sequences between the recombination signals 

20 (see, e.g,. Figures 8C, 9A-9D and 9F for non-limiting examples of insertional mutagens 
useftd in accordance with this embodiment of the invention). 

In certain such embodiments, the insertional mutagen can contain one or more 
positive selectable markers and one or more negative selectable markers. In such 
embodiments in which the insertional mutagen contains both a positive and a negative 
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selectable marker, the markers can be present as separate open reading frames or as a 
single fusion open reading frame (see Figure 9F for a non-limiting example of such a 
polynucleotide). When the selectable markers are present in the polynucleotides as 
separate open reading frames, the positive selectable marker and negative selectable 
5 marker can be expressed as a single polycistronic transcript (see Figure 9G for a non- 
limiting example of such a insertional mutagen) or as separate transcripts (see Figures 
8C and 9A-9C for non-limiting examples of such insertional mutagens). The presence 
of both a positive selectable marker and a negative selectable marker in the same 
insertional mutagen can be used, for example, to select against actively expressed genes 
10 and for developmentally regulated genes, and vice versa, Altematively, the presence of 
both a positive selectable marker and a negative selectable marker can be used to select 
for actively expressed genes that are down-regulated in response to developmental or 
environmental cues. Vectors and methods for trapping and selecting for 
developmentally regulated genes are well-known in the art (see, e.g., Gogos et al , J, 
15 Virol 77:1644-1650 (1997); Wempe et al. Genome Biol 2:research 23.1-23.10 (2001); 
and Medico et al. Nature Biotech iP:579-582 (2001); the disclosures of all of which are 
incorporated herein by reference in their entheties for these vectors and methods). 

In certain other embodiments of the invention, the insertional mutagens can 
additionally or altematively comprise one or more recombination sites, for example one 

20 or more site-specific recombination sites or signals (see Figures 8 and 9). These 
recombination sites are discrete segments on the nucleic acid molecules that are 
recognized and bound by certain recombination proteins during the initial stages of 
integration or recombination between two nucleic acid molecules that each comprise 
such a recombination site. As discussed in detail above, such site-specific recombination 

25 sites or signals are useftil for deleting (or inverting) the inserted insertional mutagen or a 
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portion tiiereof jfrom the DNA into which the msertional mutagen as inserted. This is 
useful, for example, for reverting the cellular phenotype(s) caused by the mserted 
insertional mutagen, which can be useful for confirmmg that a particular change in 
cellular phenotype is caused by a mutation induced by the insertion of the insertional 
5 mutagen. This approach is also useful for removing certain sequences ftom the inserted 
insertional mutagens, such as selectable markers, while leaving other sequences in the 
insertional mutagen, such as sequences that disrupt one or more genes in the DNA in 
which the insertional mutagen has mserted. Any site-specific recombination system can 
be used that is capable of deleting or inverting an inserted msertional mutagen. 
10 Examples of usefvil recombination signals include loxF, FRT, and att which are useful in 
conjunction with Cre, FLP (or FLPe) and PhiC31 recombinases, respectively (see, e.g., 
Hoess and Abremski, in: Nucleic Acids and Molecular Biology, vol. 4. Eds.: Eckstem 
and Lilley, BerUn-Heidelberg: Springer-Verlag; pp. 90-109 (1990); Broach, et al. Cell 
29:227-234 (1982); Ishida et al, Nucl. Acids Res. 27:e35 (1999); O'Gorman et al, 
15 Science 257:1351-1355 (1991); Bergemann et al, Nucl Acids Res. 25.4451-4456 
(1995); Sauer, B., Curr. Opin. Biotech. 5:521-527 (1994); Schwenk et al.. Nucleic Acids 
Res. 2002 Jun 1; 30(ll):2299-306; Schaft et al.; Genesis. 2001 Sep;31(l):6-10; Farley 
et al.; Genesis, 2000 Nov-Dec;28(3-4): 106-10; Rodriguez et al.; Nat Genet. 2000 
Jun;25(2): 139-40; Buchholz et al.; Nat Biotechnol. 1998 Jul;16(7):657-62; Olivares et 
20 al.; Gene. 2001 Oct 31;278(l-2:167-76; Lee L, Sadowski PD, J Biol Chem. 2001 Aug 
17;276(33):3 1092-8; Kolb AF, Anal Biochem. 2001 Marl5;290(2):260-71; Araki et al.. 
Nucleic Acids Res. 1997 Feb 15;25(4):868-72; Albert et al.. Plant J. 1995 Apr;7(4):649- 
59; Santoro et al., Proc Natl Acad Sci USA 2002 Apr 2;99(7):41 85-90; Trinh et al., J 
Immunol Methods, 2000 Oct 20;244(l-2):185-93; Soukharev et al.. Nucleic Acids Res. 
25 1999 Sep 15;27(18):e21 and U.S. Patent Nos. 4,959,317, 5,434,066, 5,888,732, 
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6,080,576 and 6,136,566; each incorporated herein by reference for teaching vectors and 
methods of site specific recombination in mammalian cells). Other examples of suitable 
recombination sites for use in the insertional mutagens of the present invention include 
the attB, att?, attL, and aUR sequences which are recognized by the recombination 
5 protein pint, and by the auxiliary proteins integration host factor (IHF), FIS and 
excisionase (Xis). See Landy, Curr, Opin. Biotech 3:699-707 (1993); see also U.S. 
Patent Nos. 5,888,732, 6,143,557, 6,171,969 and 6,277,608, each of which is 
incorporated by reference herein in its entirety. Additional examples of recombination 
systems include Hin, Gin, Pin, Cin, and VDJ recombination, all of which are well- 
10 known in the art and which will be familiar to the ordinarily skilled artisan. Other site- 
specific recombination systems known in the art would also be recognized as useful by 
the ordinarily skilled artisan, and therefore can be used in accordance with the methods 
and compositions of the present invention. The site specific recombmation signals may 
be wildtype or mutant. Mutant signals can be used, for example, to control the 
15 reversibility of the recombination reaction (reference Araki et al.. Nucleic Acids Res. 
1997 Feb 15;25(4):868-72; Dale et al.. Plant J. 1995 Apr;7(4):649-59; Trinh et al., J 
Immunol Methods, 2000 Oct 20;244(1 -2): 185-93; Soukharev et al. Nucleic Acids Res. 
1999 Sep 15;27(18):e21 incorporated in its entirety). The site-specific recombinases 
used in accordance with this aspect of the invention may be wildtype, mutant, or fusion 
20 proteins. Examples of modified recombinases useful in this aspect of the invention 
include, but are not limited to, cell-permeable CRE and CRE-ER (Jo et al. Nature 
Biotech iP:929-933 (2001); Vallier et al, Proc. Natl Acad. Set USA 95:2467-2472 
(2001); each of which is incorporated by reference herein in its entirety). The 
recombinases used in the invention can be delivered to cells by infection or transfection 
25 of an expression vector encoding the recombinase (Westerman et al, Proc. Natl Acad. 
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Set USA P5:8971-8976 (1996), which is incorporated by reference herein in its entirety), 
transfection of the protein (e.g., via electroporation; Ager et al, Radiat Res, 725:150- 
156 (1991); Chung et al, Radiat Res. 725:107-113 (1991); each of which is 
incorporated by reference herein in its entirety), or through the use of a cell-permeable 
5 recombinase (Jo et al. Nature Biotech, iP:929-933 (2001); Vallier et al, Proc. Natl 
Acad. Sci. USA P5:2467-2472 (2001); each of which is incorporated by reference herein 
in its entirety). Alternatively, the recombinase gene may itself be present on the 
insertional mutagen. The site-specific recombinase gene and/or recombination site firom 
any of these systems can be included on the insertional mutagens of the invention, or can 
10 be introduced into the host cell separately to achieve the desired recombination event. 

Using insertional mutagens containing one or more recombination sites, mutated 
cells or organisms {e.g., cells or organisms containing one or more mutated genes) 
produced by the methods of the invention can optionally be analyzed to confirm that any 
change in phenotype observed in the mutated cell or organism is the result of at least one 

15 insertionally mutated gene. One such method involves the use of site-specific 
recombination to reverse the phenotypic change, typically by inducing a reversion of the 
mutation to the wildtype by deleting or inverting the inserted insertional mutagen. In 
one such embodiment, site-specific recombination signals recognized by specific 
recombinase enzymes (e.g., the att/lnt system from bacteriophage □, the Iox/Ctq system 

20 firom bacteriophage PI, and the frt/FLF system from the Saccharomyces cerevisiae 2|x 
circle plasmid) can be included on the msertional mutagen (see Figures 9A-9H for non- 
limiting examples of such insertional mutagens containing site-specific recombination 
signals). Generally, such recombination sites are positioned on the msertional mutagen 
to allow the entire insertional mutagen, or a portion of the insertional mutagen 

25 responsible for mutating the gene, to be removed from or inverted within the DNA into 
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which the insertional mutagen has inserted by introducing the appropriate recombinase 
enzyme into the cell. Optionally, as discussed below, use of a negative selectable 
marker or a reporter gene can facilitate identification or isolation of cells in which the 
mutagenic portion of the insertional mutagen has been deleted or inverted, and any 
change in cellular phenotype as a result of such deletion or inversion can be assessed so 
as to determine the phenotypic effects of the insertional mutagenesis (e.g,, reversal or 
alteration of the cellular phenotype that is obtained upon insertional mutagenesis 
indicates that at least one insertional mutation is likely responsible for the observed 
change in phenotype firom the wildt3rpe). 

The invention can also be used to identify developmentally regulated genes. In 
one embodiment, the presence of both a positive selectable marker and a negative 
selectable marker in the same insertional mutagen can be used, for example, to select 
against actively expressed genes aad for developmentally regulated genes. 
Alternatively, the presence of both a positive selectable marker and a negative selectable 
marker can be used to select for actively expressed genes that are down-regulated in 
response to developmental or environmental cues. Vectors and methods for trapping 
and selecting for developmentally regulated genes are well-known in the art (see, e.g., 
Gogos et al, J. Virol. 77:1644-1650 (1997); Wempe et al. Genome Biol. 2:research 
23.1-23.10 (2001); and Medico et al. Nature Biotech. iP:579-582 (2001); the 
disclosiH-es of all of which are incorporated herein by reference in their entireties for 
these vectors and methods). Other methods for identifying trapped developmentally 
regulated genes in accordance with the invention involve the use of insertional mutagens 
that function as gene trap vectors such as those depicted in Figures 9B, 9C and 9H. 
These will comprise, for example, at least two site-specific recombination signals (e.g., 
at least two lox sites (e.g., loxP), at least two att sites (e.g., attP, attB, attL and/or attK), 
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at least two FRT sites, or the like), which flank the positive selectable marker (as in 
Figure 9B), the negative selectable marker (as in Figure 9C), or both the positive and 
negative selectable markers (as in Figures 9G and 9H). 

In a first such embodiment, the insertional mutagens depicted in Figures 9B, 9H, 

5 11, and 13, can be integrated using standard methods of introduction of nucleic acid 
molecules into host cells that are well-known in the art and that therefore will be famiUar 
to the ordinarily skilled artisan. Once the insertional mutagen has been integrated, cells 
can be selected based on the positive selectable marker carried by the insertional 
mutagen (and therefore integrated into DNA in the host cell). Surviving clones will 

10 contain the insertional mutagen integrated into a transcriptionally active gene, since it is 
only in such cells that the positive selectable marker will also be expressed. Cells can 
then be treated to delete the positive selectable marker (Fig. 11) or invert the positive 
marker (Fig. 13 legend) and create an operable linkage between the trapped gene and the 
negative selectable marker. In certain such aspects of the invention, this resxilt is 

15 obtained through a site-specific recombination reaction: cells are treated with a site- 
specific recombinase, the identity of which vsdll depend upon the specific recombination 
site used in construction of the insertional mutagen (e.g., Cre recombinase is used with 
lox recombination sites; Int recombinase is used with att recombination sites; FLP 
recombinase is used with frt recombination sites; etc.). Such treatment results in 

20 recombination between opposing recombination sites (see, e.g.. Figure 9B), thereby 
deleting or inverting the positive selectable marker from (and operably linking the 
negative selectable marker and the trapped gene) the genome of the cell. Cells can then 
be cultured under new conditions and/or treated with selection agents, and cells can be 
selected for lack of expression of the negative selectable marker to identify cells in 

25 which transcription of the trapped gene has been reduced or eliminated (e.g., cells that 
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survive the negative selection). This process is illustrated schematically in Figure 1 1 
and described in Fig. 13 legend.. 

In another such embodiment, the insertional mutagens depicted in Figures 9C 
and 9H can be integrated using standard methods of introduction of nucleic acid 

5 molecules into host cells that are well-known in the art and that therefore will be famiUar 
to the ordinarily skilled artisan. Once the insertional mutagens have been integrated, 
cells can be selected based on the negative selectable marker carried by the insertional 
mutagen (and therefore integrated). Cells in which the insertional mutagen has 
integrated into (e,g,, that have trapped) a transcriptionally active gene will die during 

10 selection since it is only in such cells that the negative selectable marker will also be 
expressed, whereas cells in which the insertional mutagen has integrated into a 
transcriptionally silent region of the genome wall survive. Cells can then be treated to 
delete the negative selectable marker and create an operable linkage between the trapped 
gene and the positive selectable marker, for example using site-specific recombination 

15 as outlined above. Such treatment results in recombination between opposing 
recombination sites (see, e.g.. Figure 9C), thereby deleting (Fig. 12) or inverting (Fig. 
13) the negative selectable marker such that the positive selectable marker becomes 
operably linked to the trapped gene. Cells can then be cultured under new conditions or 
treated with agents capable of inducing gene expression, and cells can be selected for 

20 expression of the positive selectable marker to identify cells in which transcription of the 
trapped gene is increased (e.g., cells that survive the positive selection). This process is 
illustrated schematically in Figures 12 and 13. 
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In certain other embodiments of the invention, the insertional mutagens can 
additionally or altematively comprise one or more reporter genes, also known as 
screenable markers. A reporter gene is a gene that encodes an expression product that 
can be detected in the cell. In accordance with the invention, the reporter gene may be 

5 expressed from a promoter on the insertional mutagen, or from a promoter in the DNA 
mto which the insertional mutagen is inserted. Detection of the reporter gene allows the 
artisan to screen for cells that are or are not expressing the reporter gene. In the present 
invention, reporter genes allow insertions to be detected. In addition, reporter genes can 
be positioned on the insertional mutagen to allow screemng for insertion events that 
10 occur in transcriptionally active or silent regions of the genome (see Figures 2-6). 
Reporter genes suitable for use in accordance with the invention can be any gene that 
encodes an expression product for which an assay exists. Examples of such suitable 
reporter genes include, but are not lunited to, enzymes, structural proteins, cell surface 
proteins, and fluorescent proteins. Specific reporter genes known in the art include P- 

15 lactamase, p-galactosidase, luciferase, chloramphenicol acetyl transferase, green 
fluorescent protein and its derivatives, yellow fluorescent protein and its derivatives, 
blue fluorescent protein and its derivatives, cyan fluorescent protein and its derivatives, 
and red fluorescent protein and its derivatives. Many other reporter genes are known in 
the art and would be recognized by the artisan as being useful in the present invention. 

20 Assays for detecting reporter genes include, but are not limited to, enzyme 

activity assays, cell microfluorimetry or fluorescence-activated cell sorting (FACS®), 
magnetic bead cell sorting, ELISA, ELISA Spot, transcriptional reporter assays, and 
cellular phenotypic assays such as proliferation, transformation, morphology, and the 
like. 
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Reporter genes can be used optionally in place of or in addition to the site 
specific recombination sequences to identify developmentally regxilated genes and to 
revert phenotypes. 

In certain other embodiments of the invention, the insertional mutagens can 
5 additionally or alternatively comprise one or more selectable markers and one or more 
reporter genes. The one or more selectable markers and one or more reporter genes can 
be present as a fusion gene or as two discrete open reading frames. Any combination of 
selectable markers and reporter genes can be used, including those detailed above. An 
example of a useful selectable reporter gene fusion is P-geo, a fusion of the neomycin 
10 resistance gene and the p-galactosidase gene. Other fusion genes known in the art, and 
which therefore are familiar to the ordinarily skilled artisan, can also be used ha the 
present invention. 

In certain other embodiments of the uivention, the insertional mutagens can 
additionally or alternatively comprise one or more splice acceptor sequences. Upon 

15 introduction hxto the target DNA, the one or more splice acceptor sequences can become 
paired with one or more splice donor sequences in the target DNA, thereby directing 
splicing from the gene in the DNA to the inserted insertional mutagen. This splicmg, ni 
tum, facilitates mutation of a gene in the target DNA through the creation of a fusion 
mRNA molecule. Any sequence capable of functioning as a splice acceptor site can be 

20 used in accordance with this aspect of the present invention. The splice acceptor site can 
be naturally occurrmg or non-naturally occurring. Splice acceptor sites and methods for 
testing the splicing activity of candidate or putative splice acceptors are known in the art, 
and therefore will be familiar to the ordinarily skilled artisan. In human cells, splice 
acceptor sites have a characteristic sequence represented as: YYYYYYYYYYNYAG, 



-76- 



wo 03/035834 PCT/US02/33714 

wherein Y denotes any pyrimidine and N denotes any nucleotide (see Nucleic Acids 
Research iP:3715-3799 (1991)). 

In other embodiments of the invention, the insertional mutagens can additionally 
or altematively comprise one or more splice donor sequences. Upon introduction into 

5 the target DNA, the one or more splice donor sequences can become paired with one or 
more splice acceptor sequences in the target DNA, thereby directing splicing from the 
polynucleotide to a gene in the target DNA. This spUcing, in tum facilitates mutation of 
the gene through the creation of a fusion mRNA molecule. The splice donor site may 
optionally be paired with a splice acceptor site on the uisertional mutagen. Typically, in 

10 such a configuration, the order of these elements will be the splice acceptor followed by 
the splice donor (see Figures lH-1 J, 2D and 2H for non-limiting examples). The spUce 
donor site may optionally be operably linked to a promoter on the insertional mutagen to 
produce a promoter-splice donor 3' gene trap (see Figures 5B-5E for non-limiting 
examples of such vectors). Any sequence capable of functioning as a spUce donor site 

15 can be used. The splice donor site can be naturally occurring or non-naturally occurring. 
Splice donor sites and methods for testing splicing activity of candidate or putative 
splice donor sites are known in the art, and therefore will be familiar to the ordmarily 
skilled artisan. In human cells, splice acceptor sites have a characteristic sequence 
represented as: (A/C)AG GURAGU, wherein R denotes a purine nucleotide (see Nucleic 

20 Acids Research 7P:3715-3798 (1991)). The insertional mutagen may contain one or 
more exon sequences. These can be naturally occurring or non-naturally made, as by 
recombinant DNA or synthetic methods. The exons may be derived from eukaryotic 
genes. Further, the one or more exons can be in tandem. 
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In other embodiments, the insertional mutagens can additionally or alternatively 
comprise one or more sequences that direct the addition of 5' or 3* polynucleotide tails 
on mRNA molecules transcribed from the DNA into which the insertional mutagen 
integrates. Such sequence can encode any polynucleotide tail, such as poly (A) tails, 
5 poly (G) tails, poly (U) tails, poly (C) tails, poly (I) tails, and the like. In one 
embodiment, the insertional mutagen comprises one or more polyadenylation signals 
that direct the addition of poly (A) tails on mRNA molecules transcribed from the DNA 
into which the insertional mutagen integrates. Polyadenylation signals can be derived 
from naturally occurring or non-naturally occurring sequences. Examples of useful 
10 polyadenylation signals include, but are not limited to, those derived from SV40 genes, 
growth hormone genes (e.g., bovine growth hormone), p-globin genes, actin genes, 
serum albumin genes, and retrovirus genes. Other polyadenylation signals are known in 
the art and will therefore be familiar to the ordinarily skilled artisan. 

In certain other embodiments of the invention, the insertional mutagens can 
15 additionally or alternatively comprise one or more intemal ribosomal entry sites (IRES). 
The IRES allows translation of intemal open reading frames and are useful for 
expressing open reading frames located on the insertional mutagens upon integration 
into transcriptionally active genes. In certain embodiments, the ORF on the insertional 
mutagen is a selectable marker and/or reporter gene. Any IRES can be used to express 
20 an ORF located on the insertional mutagen. Examples of useful IRESs and methods of 
measuring IRES activity are known m the art (see, for example, Zhou et al, PNAS 
P5:1531-1536 (2001), Owens et al, PNAS PP:1471-1476 (2001); Venkatesan et al, Mol 
Cell Biol 8: 2826-2837 (2001); Jackson a/.. Trends Biochem. Set 15: 477-483 (1990); 
and Jang et al, J. Virol 62:2636-2643 (1988); each incorporated herein by reference for 
25 teaching IRES sequences and methods for measiiring IRES activity). 
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In certain other embodiments of the invention, the insertional mutagens can 
additionally or alternatively comprise one or more transposon signals. 1. Cui et al., J 
Mol Biol. 2002 May 17;3 18(5): 1221-35; 2. Izsvak et al, J Biol Chem. 2002 Jun 24; 3. 
Dupuy et al., Proc Natl Acad Sci USA. 2002 Apr 2;99(7):4495-9; 4. Horie et al., Proc 
5 Natl Acad Sci USA. 2001 Jul 31;98(16):9191-6; 5. Dupuy et al.,Genesis. 2001 
Jun;30(2):82-8; 6. Fischer et al., Proc Natl Acad Sci USA. 2001 Jun 5;98(12):6759-64; 
7. Ivies et al.,Cell. 1997 Nov 14;91(4):501-10. Other transposons also function in 
mammals: 8. Zagoraiou et al., Proc Natl Acad Sci USA. 2001 Sep 25;98(20):1 1474-8; 
9. Sherman et al., Nat Biotechnol. 1998 Nov;16(ll):1050-3; 10. Kawakami et al., Proc 
10 Natl Acad Sci USA. 2000 Oct 10;97(21):1 1403-8; 11. Fadool et al., Proc Natl Acad Sci 
USA. 1998 Apr 28;95(9):5 182-6; 12. Plasterk RH, Cell. 1993 Sep 10;74(5):781-6; 13: 
Kaufinan et al.,Nucleic Acids Res. 1991 Nov 25;19(22):6336; 14. Rubin et al., Nucleic 
Acids Res. 1983 Sep 24;11(18):6341-51; 15. Spradling et al.. Science. 1982 Oct 
22;218(4570):341-7; Ac and Ds and other plant transposons transpose, integrate and are 
15 used as insertional mutagens in plants: 16. Grevelding et al, Proc Natl Acad Sci USA. 
1992 Jul l;89(13):6085-9; 17. Walbot V., Curr Opin Plant Biol. 2000 Apr;3(2): 103-7; 
18. Pereira et al. Methods Mol Biol. 1998;82:329-38; 19. Cooley et al., Mol Gen Genet. 
1996 Aug 27;252(l-2): 184-94; 20. Bhatt et al, Plant J. 1996 Jun;9(6):935-45. P element 
transposon can integrate broadly throughout genomes: 21. Kassis et al., Proc Natl Acad 
20 Sci USA. 1992 Mar l;89(5):1919-23; 22. Berg et al.. Genetics. 1991 Mar;127(3):515- 
24; 23. Tower et al.. Genetics. 1993 Feb;133(2):347-59; 24. Cooley et al.. Prog Nucleic 
Acid Res Mol Biol. 1989;36:99-109; 25. Cooley et al.. Science. 1988 Mar 
4-239(4844):l 121-8; 26. Spradling et al., Proc Natl Acad Sci USA. 1995 Nov 
21;92(24):10824-30. 
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Transposon signals allow the insertional mutagens to insert into the DNA by 
expressing or otherwise introducing transposase in the cell with the insertional mutagen. 
In a preferred embodiment, the insertional mutagen is first introduced into DNA in a 
cell, and subsequently transposed, or "hopped," in order to insertionally mutate one or 
5 more genes. This can be done in vitro, in vivo, and in situ. Methods for transposing 
vectors in situ are well known in the art (see, for example, Lucklow etal, J. Virol 
57:4566-4579 (1993); Ivies et al, Cell P7:501-510 (1997); and Luo et aU PNAS 
P5: 10769-1 0773 (1998); each incorporated herein by reference for teaching vectors and 
methods of transposition thereof). 

10 There are many transposon systems and transposon signals known in the art that 

are useful in the present invention. These include TY from yeast (e.g., TYl; see Devine 
and Boeke Nucl Acids Res, 22:3165-3112 (1994), which is incorporated by reference 
herein in its entirety), P-elements, Hobo, Minos, and Manner from Drosophila, Tn5, 
Tn7, TO, TnlO, and Mu from bacteria, En/Spm from maize, and TCI/Mariner (and its 

15 derivatives, such as Sleeping Beauty) and Minos from mouse and C. elegans. Many 
other transposon systems are known in the art and would be recognized by the artisan as 
usefiil in the present invention. In addition, modified transposon signals and mutant 
transposases with enhanced efficiency have been described and would be useful in the 
present invention. 

20 Any of the insertional mutagens described herein for insertion can be produced 

as viral vectors, such as retroviral vectors (including lentivirus). Herpesviruses vectors 
(such as Epstein-Barr virus, cytomegalovims (CMV), Herpes zoster, and Herpes 
simplex), papillomavirus, picomavirus, papovavirus (such as polyoma vectors and 
SV40), adenovims, adeno-associated virus, and hepatitis virus. Particularly preferred 
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vectors are retroviral. Viruses have the advaatage of efficiently introducing the 
insertional mutagens into a cell, and, in the case of some viruses, facilitating efficient 
delivery of the insertional mutagen to the cell, and integration of the insertional mutagen 
into DNA in a cell. 

5 Retroviral vectors of the invention and of use in the methods of the invention can 

contain retroviral LTRs, packaging signals, and any other sequences that facilitate 
creation of infectious retroviral vectors. Retroviral LTRs and packaging signals allow 
the insertional mutagens of the invention to be packaged into infectious particles and 
delivered to the cell by viral infection. Methods for making recombinant retroviral 
10 vectors are well known in the art (see for example, Brenner et al, PNAS 55:5517-5512 
(1989); Xiong et al, Developmental Dynamics 272:181-197 (1998) and references 
therein; each incorporated herein by reference). In preferred embodiments, the retroviral 
vectors used in the invention are reverse orientation vectors, meaning that the gene 
mutation element in the insertional mutagen is in the opposite direction of viral 
15 transcription. The retroviral vectors can also be Self Inactivating viruses (SIN viruses). 
SIN viruses are not transcriptionally active upon insertion. Methods for making SIN 
viruses are known in the art (see for example, Hawley et al, PNAS 5¥:2406-2410 
(1987); Brenner et al, PNAS 5<5:5517-5512 (1989); and Lih et al., Gene 164:289-294 
(1995); each incorporated herein by reference for teaching vectors). Retroviral LTRs 
20 and packaging signals can be selected according to the intended host cell to be infected. 
Examples of retroviral sequences usefial in the present invention include those derived 
from Murine Moloney Leukemia Virus (MMLV), Avian Leukemia Virus (ALV), Avian 
Sarcoma Leukosis Virus (ASLV), Feline Leukemia Virus (FLV), and Human 
Inmiunodeficiency Virus (HIV). Other viruses known in the art are also useftil in the 
25 present invention and therefore will be famihar to the ordinarily skilled artisan. 
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In certain other embodiments, the insertional mutagens can additionally or 
alternatively comprise one or more bacterial sequences useful for propagating the 
insertional mutagens in prokaryotic or eukaryotic cells. Thus, the insertional mutagens 
of the invention can contain, for example, one or more antibiotic resistance markers, 
5 and/or one or more other art known sequences useful for propagating and analyzing the 
insertional mutagens. 

Any of the insertional mutagens described herein can further or alternatively 
comprise one or more 3* gene trap. A number of 3* gene traps have been described and 
are useful in the present invention (see e.g., Harrington et aL, WO 99/15650; 

10 Zambrowicz et al, U.S. Patent No. 6,080,576; Sands et al, U.S. Patent No. 6,136,566; 
Niwa et al, J, Biochem, 775:343-349 (1993); Yoshida et al, Transgenic Research 
^:277-287 (1995); all incorporated herein by reference in its entirety for teaching 3' gene 
traps). The 3' gene trap can be used to recover exons in the target DNA that are 
downstream of the insertional mutagen insertion site. Optionally, it can also be used to 

15 activate RNA or protein expression from genes in the target DNA. Wlien a 3' gene trap 
cassette is contained in a insertional mutagen of the present invention, it can be located 
upstream or downstream of a mutagenic portion of the insertional mutagen. In preferred 
embodiments, the 3' gene trap is located 3' of the mutagenic portion of the insertional 
mutagen. 

20 Further, any of the insertional mutagens described herein can further or 

altematively comprise one or more 5' gene trap cassettes. A number of 5' gene traps 
have been described and are useful in the present invention (see e.g., Zambrowicz et aL, 
U.S. Patent No. 6,080,576; Sands et aL, U.S. Patent No. 6,136,566; Niwa et al, J. 
Biochem. 113:343-349 (1993); Yoshida et al, Transgenic Research ^:277-287 (1995); 
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all incorporated herein by reference in its entirety for teaching 3' gene traps). The 5' 
gene trap can be used to recover exons in the target DNA that are downstream of the 
insertional mutagen insertion site. Optionally, it can also be used to activate RNA or 
protein expression from genes in the target DNA. When a 5' gene trap cassette is 
5 contained in a insertional mutagen of the present invention, it can be located upstream or 
downstream of a mutagenic portion of the insertional mutagen. In preferred 
embodiments, the 5' gene trap is located 5* of the mutagenic portion of the insertional 
mutagen. 

Integration of the Insertional Mutagens 

10 The insertional mutagens of the invention can be introduced into a cell and 

integrated into DNA by any method known in the art. In a preferred embodiment, they 
are introduced by transfection. Methods of transfection include, but are not limited to, - 
electroporation, particle bombardment, calcium phosphate precipitation, lipid-mediated 
transfection (e.g., using cationic lipids), micro-injection, DEAE-mediated transfection, 

15 polybrene mediated transfection, naked DNA uptake, and receptor mediated 
endocj^osis. 

In another preferred embodiment, the insertional mutagens are introduced by 
viral transduction or infection. Suitable viral vectors useful in the present invention 
include, but are not limited to, adeno-associated virus, adenovirus vectors, alpha- 
20 herpesvirus vectors, pseudorabies virus vectors, herpes simplex virus vectors and 
retroviral vectors (including lentivural vectors). Methods for making and using viral 
vectors are described above and elsewhere herein, and are well-known in the art and 
therefore familiar to the ordinarily skilled artisan (see, for example. Viral Vectors: Gene 
Therapy and Neuroscience Applications E. Caplitt and Loewy, Academic Press, San 
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Diego (1995); incorporated herein by reference for teaching viral vectors and methods of 
using such vectors for introducing and expressing polynucleotides of interest). 

In a preferred such embodiment, the vectors are retroviral vectors (including 
lentiviral vectors) and are introduced into the cell by infection. Vectors containing viral 
5 LTRs and packaging signals are described above. Methods for packaging retroviral 
vectors are also known in the art and can be used in the present invention (see, for 
example, U.S. Patent No. 5,449,614, the disclosure of which is incorporated herein by 
reference in its entirety for teaching vectors, packaging cell lines, and methods of 
making and packaging viral vectors). 

10 Following induction of an endogenous insertional mutagen or its introduction 

into a cell by transfection or infection, the insertional mutagens of the mvention 
integrate into the genome of the cell. The insertional mutagen can integrate into the 
target DNA by any method including, but not limited to, non-homologous recombination 
including retroviral insertion and transposition, site-specific recombination, homologous 

15 recombination and the like. In certain preferred embodiments, the insertional mutagen 
integrates by non-homologous recombination (e.g., integration by DNA end-joining, 
retroviral insertion, or transposition). 

In certain preferred embodiments of the invention, the cell can be treated with 
one or more DNA-breaking agents prior to, during, or following induction or 
20 introduction of the insertional mutagen into the cell. DNA-breaking agents increase the 
efficiency of integration. Examples of DNA-breaking agents suitable for use in 
accordance with this aspect of the invention include, but are not limited to, y-radiation. 
X-ray irradiation, UV irradiation, bleomycin, peroxides, and restriction enzymes. Other 
agents known to break DNA in living cells can also be used. Methods of using DNA 
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breaking agents to enhance insertional mutagen insertion have been described (see, e,g,, 
Harrmgton et al, WO 99/15650, incorporated herein by reference for teaching methods 
of enhancing nonhomologous recombination). 

In one embodiment, the initial integration is not the mutagenic event. Where the 
5 insertional mutagen contains a transposition signal, after the initial integration, the 
methods of the invention can be carried out by inducing an endogenous insertional 
mutagen to transpose ("hop") to a new location, where the mutagenic event(s) can occur. 
Methods for transposing vectors in situ are well known in the art and therefore will be 
familiar to the ordinarily skilled artisan (see, for example, Ivies et al, Cell Pi:501-510 
10 (1997); and Luo et al, PNAS P5: 10769- 10773 (1998); each incorporated herein by 
reference for teaching transposition vectors and methods). 

A variety of genes can be mutated using the methods of the invention. For 
example, known genes, including disease-causing genes (e.g., oncogenes, integrated 
viral genes (including HIV), genes causing genetic abnormalities such as cancers, 

15 multiple sclerosis, Alzheimer's disease, diabetes, muscular dystrophy, ALS, Gaucher' s 
Disease, Tay-Sachs disease, hemophilia, p-thalassemia, cystic fibrosis, sickle cell trait, 
and the like) and normal genes (imparting any phenotype to a cell or organism) can also 
be mutated using the methods of the invention. In another embodiment, genes which 
have been previously unknown or incompletely characterized can be mutated using the 

20 methods of the invention. In another embodiment, genes can be mutated that are known 
or characterized but which were not known to be correlated to a desired phenotype 
produced by the mutagenesis methods of the invention. The invention thus provides a 
way to identify, isolate and characterize previously vinknown or incompletely 
characterized genes in a variety of eukaryotic cells, and to examine the phenotypic 
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importance of such genes by examining the effects on cellular phenotype when the genes 
are mutated. 

It is also possible to have multiple (more than one per cell) insertional mutagens 
in each target cell to increase the probability that at least one gene will be mutated in the 

5 cell. Thus, cells created by the present methods can contain one or more integrated 
insertional mutagen. In certain embodiments, each of the target cells will contain 1-10 
insertional mutagens, or 10 or more insertional mutagens. Two or more insertional 
mutagens in a single cell has the advantage of reducing the total number of cells that 
must be screened to identify a cell with a mutation of a desired gene or of a gene(s) that 

1 0 causes a desired phenotype. 

The number of insertional mutations that would be useful depends on the size of 
the genome of the host cell, its ploidy, gene structure, the average insertion window 
associated with gene mutation (e.g,, size of the gene), the amount of genome coverage 
that is desired (e.g., the percentage of genes that are to be insertionally mutated), the 

15 propensity for integration of the insertional mutagen into genes, and the number of genes 
capable of producing a desired phenotype when mutated. In higher eukaryotic 
organisms, the genome is typically large. In mouse and human cells, for example, the 
haploid genome is estimated to be 3 x 10^ base pairs (6 biUion basepairs for the diploid 
genome). Therefore, 10^ insertions will result in an insertional mutagenic event 

20 frequency of 1 insertion per 3,000 base pairs. Assimiing that an average human gene is 
approximately 25,000 base pairs, and assuming random insertion, 10^ insertions at a 
frequency of 1 in 3,000 base pairs will, on average, result in the mutation of one copy of 
each gene at least once. In practice, it can be necessary to create a larger or smaller 
number of insertions if integration is foimd not to be random. For example, it can be 
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necessary to produce 10^ or more random insertions to insertionally mutate the majority 
of genes at least once. It is also useful in certain situations or for certain applications to 
create fewer insertions. In preferred embodiments, an insertion library containing at 
least 10,000 insertions is created. In highly preferred embodiments, an insertion library 
5 containing at least 100,000 insertions is created. In the situation where the desire is to 
focus mutagenesis on those genes that can be transcriptionally active in the cell under 
study, then use of insertional mutagens that show a bias for insertion into active genes 
(e.g., retroviruses or transposons) can enable the creation of libraries of lO'^-lO^ 
members that contain knockouts of all active genes. 

10 Since genome sizes vary, it is possible to rely on general guidelines for 

determining the number of insertions necessary to create a library of a given complexity. 
As a general guideline it is useful to create 1 insertion per 1000 to 10,000 base pairs of 
the host cell genome. In one embodiment, 1 insertion is created per 30,000 base pairs of 
host cell genome. In highly preferred embodiments, the insertion frequency is adjusted 

15 to create at least one mutation per gene in the library. It should be understood that these 
are general guidelines and that other insertion frequencies are possible and recognized 
by those skilled in the art. 

In embodiments where the insertional mutagen contains a selectable marker or 
reporter gene, cells can be selected for integration by selecting for expression of the 
20 selectable marker. If the selectable marker is expressed from a promoter on the 
insertional mutagen (see, for example. Figure 14), then any cell containing an integrated 
insertional mutagen should be recovered. If the selectable marker is not expressed from 
a promoter on the insertional mutagen, but is expressed from an upstream promoter on 
the target DNA (see Figures 2-6 for examples), then any cell containing the insertional 
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mutagen integrated into a transcriptionally active gene can be selected for or against 
depending on whether a positive or negative selectable marker, respectively, is being 
used. Integration into transcriptionally active genes is desirable because in instances 
where the cells are going to be screened for a phenotype, mutation of a transcriptionally 
5 active gene is more likely to give a phenotype than mutation of a transcriptionally 
inactive gene. Altematively, selection against a transcriptionally active gene can be 
useful for removing cells that have insertionally mutated a transcriptionally active gene 
so that other genes can be studied. For example, after removing cells containing 
mutated transcriptionally active genes, cells can be treated with agents that cause a 
10 change in gene expression within the cell and the artisan can now screen for phenotypes 
that result from mutation of genes that were previously silent but became active. 
Reporter genes can also optionally be used to screen for integration into transcriptionally 
active or silent genes. 

Library Construction 

15 The two mutagenesis methods can be carried out on one cell. The cell can then 

be screened for mutation of a specific desired gene or for a desired phenotype that 
results from mutation of one or more genes. 

In one embodiment, insertional mutagenesis is carried out on a single 
physicochemically mutated clone of cells. For example, a single physicochemically 
20 mutated cell can be expanded and used for insertional mutagenesis. This creates a 
library of cells that are mutagenized by both methods. Typically, for mammalian cells, 
between 1 and 10^ or more gene trap insertions are created for each physicochemically 
mutated clone. A gene trap insertion is an instance where genomic integration of the 
insertional mutagen has become operably linked to a constitutively or conditionally 
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active endogenous gene such that exonic sequences in the insertional mutagen are 
incorporated into the transcripts of the endogenous gene. Many of these gene trap 
insertions will disrupt the normal function of the insertionally mutagenized genes. In 
some embodiments this range would be to 10^ to 10"^ insertions per mutated clone. In 
5 certain preferred embodiments, the number of insertion mutants created for each 
physicochemically mutated clone ranges between 10"^ and 10^, and often between 10"^ 
and 10^ In one embodiment, after the insertional mutagenesis, the resulting library can 
be screened for integration of one or more of the insertional mutagens. In another 
embodiment, the resulting library is screened for mutation of a specific gene of interest 

10 or for a phenotype that results from mutation of a gene of interest, or for a phenotype 
that results from mutation of a gene that has not been previously identified or not 
previously known to be correlated with the phenotype. The mutant cells produced by 
these methods of the invention also can be cloned. The mutant cells that exhibit the 
phenotype derived from the combined mutagenesis can be cloned, for example to 

1 5 generate a purified population for further manipulation or use. 

In another embodiment, insertional mutagenesis is carried out on more than one 
physicochemically mutated clone of cells. For example, more than one 
physicochemically mutated cell can be expanded for insertional mutagenesis. The 
number of physicochemically mutated clones can range from 1 to 10^ or more. In 

20 prefeixed embodiments, the nimiber of physicochemically mutated clones used to create 
the insertion mutants is between 1 and 100,000 and more preferably between 1 and 
10,000. In highly preferred embodiments, the number of physicochemically mutated 
clones used to create the insertion mutants is between 1 and 1000. Usefiil ranges include 
approximately 2, 5, 10, 25, 50, 100, 200, 400, 600, and 800 clones. Guidelines for 

25 assessing physicochemical mutation frequency and selecting appropriate numbers of 
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mutated cells are discussed above in the section describing physicochemical 
mutagenesis. The selected number of physicochemically mutated cells can be expanded 
and used to create a library of cells that are mutagenized by the two methods. Typically, 
for mammalian cells, between 1 and 10^ or more insertions are created for each 
5 physicochemically mutated clone present. In preferred embodiments, the number of 
insertion mutants created for each physicochemically mutated clone ranges between 10"^ 
and 10^, and often between lO"^ and 10^ For example, without limitation, if 100 
physicochemically mutated clones are used to create a library, then a library with 10"^ 
insertions per clone would result in the creation of a library with 10^ clones (100 
10 physicochemically mutated clones x lO"^ insertionally mutated clones). In one 
embodiment, the resulting library can be screened for integration of one or more 
insertional mutagens after the insertional mutagenesis. In another embodiment the 
resulting library can be screened for mutation of a gene of interest, for a phenotype that 
results from mutation of a gene of interest, or for a phenotype that results fi-om a 
15 mutation of a gene not previously identified or not known to be correlated with tlie 
phenotype. The mutant cells produced by these methods of the invention also can be 
cloned. 

It is also possible to make the library by insertional mutagenesis and then use 
that library for physicochemical mutagenesis. In one embodiment, one or more cells is 

20 mutated by integration of one or more insertional mutagens. Typically, in mammalian 
cells, the insertional mutagen is inserted into 1 to 10^ or more cells. In preferred 
embodiments, the insertional mutagen is inserted into between 10"* to 10^ cells, and often 
into between lO"^ to 10^ cells. Cells contaimng the integrated insertional mutagens are 
then physicochemically mutated. With either procedure, in one embodiment, clones are 

25 expanded prior to physicochemical mutagenesis. Preferably, between 50 and 10,000 

-90- 



wo 03/035834 PCT/US02/33714 
genes are mutated in each physicochemically mutated cell. In certain embodiments, 
approximately 1,000 genes are mutated in each target cell. Once created, the 
physicochemically/insertionally mutated library of cells can be screened for mutation of 
a gene of interest or for a phenotype that results from mutation of a gene of interest, or 
5 that results from mutation of a gene that has not been previously identified or known to 
be correlated to the phenotype. The mutant cells produced by these methods of the 
invention also can be cloned. 

Library Screening 

Libraries of mutant cells can be screened for mutation of a desired gene. Gene 
10 expression levels or gene product activity could be assayed or another phenotype that is 
associated specifically with the desired gene could be assayed. The assays can be used 
to identify cells with reduced or missing gene expression or function or with increased 
or restored gene expression or function. The tag is still useful in this embodiment. It 
can be used to verify that the mutation is in the desired gene or to ascertain if the desired 
15 gene is improperly expressed because of a mutation in a separate gene. The tag could 
also be usefiil as a way to isolate the mutated cell or clone of cells from a large number 
of cells when there is no assay that is sufficiently sensitive. 

Examples of usefiil assays to detect a desired gene include, but are not limited to, 
ELISA, ELISA spot assays, PGR (e.g., rtPGR), transcription reporter assays, westem 
20 blot, northern blots, Southem blots, electrophoretic mobility shift assays, transcriptional 
profiling (e.g., using gene chips), enzyme assays (e.g., protease, kinase, phosphatase, 
hydrolase, and other known assays), ligand binding assays, and Fluorescence Activated 
Gell Sorting (FAGS®) and magnetic bead cell sorting. 
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Librajdes of mutant cells produced by the present invention can also be screened 
for desired phenotypes. Cells displaying the desired phenotype can then be used to 
identify one or more mutant genes responsible for the phenotype where the mutagenic 
polynucleotide comprises a tag that tags the mutated gene. This approach can be used, 
5 for example, to identify mutant genes that play a role in a cellular or biochemical 
process. By such methods of the invention, changes in a variety of cellular phenotypes 
that may be associated with genetic mutations may be analyzed, including without 
limitation: cell proliferation, cell transformation, cell migration, cell differentiation, 
signal transduction, cell morphology, cell transport, protein degradation, apoptosis, 
10 chemoresistance, chemosensitivity, inflammatory response, nuclear translocation of 
proteins, protein secretion, cellular activation, gene activation, protein expression, 
receptor activation, and metastasis. See also the previous list above. Many other assays 
are known in the art that the ordinarily skilled artisan would recognize as useful in the 
present invention. 

15 The methods of the present invention can also be used in screens for the presence 

of conditional mutations in cells or organisms. Conditional mutations allow a mutation 
in a given gene to remain silent until a phenotypic screen (often dependent upon 
expression of the gene) is performed. This approach is particularly advantageous in 
situations where, for example, a mutation is toxic to the host cell, creates a slow-groAvth 

20 or no-growth phenotype, kills the cell, induces terminal differentiation, or is otherwise 
deleterious to the cell. Examples of such conditional mutations that may be used in 
accordance with, or that may be detected by, the methods of the present invention 
include but are not limited to temperature sensitive mutations (heat- or cold-sensitive 
mutations), sensitivity to chemicals such as dimethylsulfoxide, site-specific 

25 recombination in vitro or in vivo, translation read-through, and the like. 
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Uses of Mutated Cells 

Once a cell with a desired phenotype is identified, the mutated gene can be 
identified via the tag present on its protein or mRNA or by analyzing the genomic 
integration site of the insertional mutagen, as discussed below. Methods for isolating the 

5 tagged gene include, but are not limited to, 5' RACE, inverse PCR, and cDNA library 
construction and hybridization. Methods for cloning genes that have been mutated or 
activated are known in the art (see for example, Harrington et al, U.S. Patent 
Application No. 09/276,820 filed March 26, 1999; Zambrowicz et al, U.S, Patent No. 
6,080,576; Sands et al, U.S. Patent No. 6,136,566; Niwa et al, J. Biochem, 773:343- 

10 349 (1993); Yoshida et al, Transgenic Research ^:277-287 (1995); Baker et al, Dev. 
Biol / 55:20 1-2 14 (1997); each incorporated herein by reference for teaching methods of 
identifying genes mutated by the mutagenic polynucleotides). 

The present invention can also be used to discover novel drugs and drug targets 
for use in diagnosing, treating or preventing a variety of diseases and physical disorders. 

15 For example, cDNA molecules and genomic fragments containing mutated genes of 
interest can be used to produce a gene product in vitro or in a cell or animal, to screen 
drugs, develop new diagnostic methods or assays related to the genotype or phenotype 
of interest, or to express proteins for therapeutic use {e.g., the gene may encode a protein 
such as erythropoietin, that can be administered to patients to treat a condition). The 

20 mutant gene or gene product also can be used to identify the corresponding wild-type 
gene or gene product. The wild-type gene can be used to produce a wild-type gene 
product in vitro or in a cell or animal, to screen drugs, to develop new diagnostic 
methods or assays related to the genotype or phenotype of interest, or to express proteins 
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for therapeutic use (e.g,, the gene may encode a protein such as erythropoietin that can 
be administered to patients to treat a condition). 

Mutated cells made using the present invention can be used in drug screening. 
For example, mutated cells displaying a therapeutically relevant genotype or phenotype 
5 can be isolated from a library of mutated cells. Once isolated, the mutated cells can be 
exposed to test compounds to identify compoxmds that inhibit, further stimulate, or 
otherwise modulate the genotype or phenotype of interest. By carrying out this process, 
drugs and/or drug leads can be identified. Examples of phenotypes relevant to drug 
screening include, but are not limited to, apoptosis, cell proliferation, chemosensitivity, 
10 chemotherapeutic resistance, cell migration, cell activation (e.g., T cell activation), cell 
transformation, metastasis, cellular differentiation, signal transduction, transcriptional 
activation, protein expression, protein degradation, protein secretion, and other 
phenotypes known in the art that will be readily apparent to the ordinarily skilled artisan. 

Mutated cells prepared according to the present invention can also be used for 
15 manufacturing or other commercial purposes. For example, cells of the invention 
displaying high growth rates, high protein expression levels, high levels of protein 
secretion, optimized post-translational modification of expressed proteins, ability to 
grow in serum-free or other defined or inexpensive culture media, etc., offer an 
advantage in commercial applications such as in manufacturing proteins, foods, 
20 beverages, therapeutics, etc. 

Mutated cells prepared according to the methods of the present invention can 
also be used to study gene function in vivo. In one embodiment, cells mutated by the 
present invention are mtroduced into an animal by adoptive transfer. Cells displaying a 
desired phenotype in the animal can then be recovered and isolated. Alternatively, 
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mutated cells that display a desired phenotype in culture can be introduced into an 
animal by adoptive transfer to study the in vivo phenotype of the cell. Examples of in 
vivo assays include, but are not limited to, txmior formation, metastasis, graft versus host 
disease, autoimmune disease, transplant rejection, reconstitution of missing or non- 
5 functional cell types (e.g., bone marrow transplantation), cell differentiation, and other 
assays known in the art. Methods for introducing cells into an animal by adoptive 
transfer are well known in the art (see, for example. Roth et al, J Exp Biol 200:2057- 
2062 (1997); Mosier Adv Immunol 50:303-325 (1991); Mule et al, J. Immimother. 
72:196-198 (1992); each incorporated herein by reference for teaching methods and uses 
10 of adoptive transfer). Optionally, the mutated gene can be identified from the mutated 
cell. 

In another embodiment, mutated cells (e.g., somatic or germ cells, embryonic 
stem cells or adult multipotential stem cells) can be used to create a transgenic animal. 
Methods for making transgenic animals from embryonic stem cells are well known in 

15 the art (see for example, Jackson and Abbott (2000) Mouse Genetics and Transgenics, 
Oxford University Press, pgs. 266-284; and Hogan, Beddington, Costantim, and Lacy 
(1994) Manipulating the Mouse Embryo, Cold Spring Harbor Press, all pages; Joyner, 
Bioessays 75:649-656 (1991); each reference incorporated herein by reference for 
teaching methods of producing transgenic animals from stem cells). Similarly, methods 

20 for making transgenic animals from somatic or germ cells are well known in the art (see, 
e,g„ U.S. Patent Nos. 5,322,775, 5,366,894, 5,476,995, 5,650,503 and 5,861,299, all of 
which are incorporated herein by reference in their entireties for teaching methods of 
producing transgenic animals from mutated or genetically manipulated somatic or germ 
cells). One such method is nuclear transfer cloning, in which the nucleus of a donor 

25 somatic cell is genetically modified {e.g,, using the mutational meliiods of the present 
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invention), and then the nucleus is removed from the donor cell and placed into a 
recipient cell (preferably, an oocyte) to produce a transgenic animal containing the 
genetic modifications from the donor nucleus. This process is well-known in the art and 
will be familiar to the ordinarily skilled artisan (see, e.g,, Campbell et al. Nature 
5 550:64-66 (1996); CibelH et al. Nature Biotech. 16:620-621 (1998); McCreath et al. 
Nature 405:1066-1069 (2000); Hochedlinger et al. Nature ¥75:1035-1038 (2002); 
Schnieke et al. Science 275:2130-2133 (1997); Kasinathan et al. Nature Biotech 
iP:1176-1178 (2001); Wolf et aL, Arch. Med Res, 32:609-613 (2001); the disclosures of 
all of which are incorporated herein by reference in their entireties). 

10 In one embodiment, transgenic animals that contain an insertional mutagen that 

is associated with a specific mutated gene can be prepared by, for example, 
physicochemically or insertionally mutating sperm cells in vivo and physicochemically 
or insertionally mutating oocytes in vitro or in vivo (e.g,, one or more lentiviral vectors), 
and then fertilizing the mutated oocytes with the mutated sperm cells to produce a 

15 homozygous mutant zygote. This zygote can then be implanted into a recipient female 
and carried to term, thereby producing a transgenic animal homozygous for one or more 
mutations. Other methods for producing transgenic animals are well-known in the art, 
and will be familiar to the ordinarily skilled artisan (see, e.g., WO 90/05188; Hammer, 
R.E., et al, J. Animal Set 53:269-278 (1986); Pursel, V.G., et al, J. Reprod. Pert. 

20 Suppl 40:235-245 (1995); Houdebine, L.-M., J. Biotechnol 34:269-287 (1994); 
Hammer, R.E., et al. Nature 375:680-683 (1985); Mortensen, R.M., et aL, Mol Cell 
Biol 73:2391-2395 (1992); Deng, C, etal , Cell 82:61 5-6M (1995); and Murakami, T., 
et al, Devel Gen. 70:393-401 (1989), the disclosures of all of which are incorporated 
herein by reference in their entireties). 
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Transgemc animals can be created in any eukaryotic organism. In preferred 
embodiments, the transgenic organism is a fly, a worm, a fish, or a mammaL In highly 
preferred embodiments, the organism is a human, a non-human primate, a mouse, a rat, a 
pig, a cow, a sheep, a dog, a cat, a bird, a zebrafish, C. elegans, or Drosophila, The 
5 transgenic animal can be used to carry out genetic screens for phenotypes of interest or 
for studying the function of individual genes. Examples of phenotypes include, but are 
not limited to, weight, height/length, organ histology, organ function, immune 
competency, blood chemistry {e,g., cholesterol levels, etc.), bone density and structure, 
gross morphology, and behavior. Additional phenotypic screens are known in the art 
10 and useful in the present invention (see for example Nolan et al, Nature Genetics 
25:440-443 (2000); incorporated herein by reference for teaching phenotypic screens). 

In another embodiment, multicellular organisms can be mutated directly by in 
vivo mutagenesis. An animal can be produced from a cell that is mutated in vitro. 
Mutagenesis can be physicochemical or insertional. The cell can be a stem cell 

15 (embryonic or adult), somatic cell, or germ cell. The mutation that is introduced into the 
animal this way could be a heterozygous mutation in a gene where a homozygous 
mutation is necessary to produce a phenotype in an organism or in a cell in an organism. 
The organism can be mutagenized directly to produce the homozygous mutation. A 
gene responsible for the phenotype can be identified by a tag in the cell used to make the 

20 animal or in a cell insertionally mutagenized in the intact animal. The mutation that is 
introduced may be part of a set of mutations (i.e., mutation in two or more different 
genes) that are all required to produce a phenotype in an organism or in a cell in an 
organism. The organism can be mutagenized directly to produce the other required 
mutations. 
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Alternatively, transgenic plants can also be produced according to the methods of 
the present invention. In such methods, one or more, or suitably two or more, genes or 
alleles in a plant cell is mutated according to the methods of the invention. Transgenic 
plants may then be prepared using tiiis mutated genomic DNA according to art-known 
5 plant genetic engineering techniques, such as nuclear transfer, transformation or 
protoplast fusion (see Hall, Robert D., Plant Cell Culture Protocols, Humana Press, 
New Jersey (1999); Gartland and Davey, Agrobacterium Protocols, Humana Press, New 
Jersey (1995); Kosuge et al. Gen. Eng, of Plants 26:5-25 (1983); Rogers et aL, in: 
Methods for Plant Molecular Biology, A. Weissbach and H. Weissbach, eds.. Academic 
10 Press, Inc., San Diego, CA (1988)). Such techniques are widely in use (see, e.g., 
Chaleff, R.S,, Genetics of Higher Plants: Applications of Cell Culture, Cambridge: 
Cambridge University Press (1981)), and newly inserted foreign genes have been shown 
to be stably maintained diiring plant regeneration and are transmitted to progeny as 
typical MendeUan traits (Horsch et al. Science 223:496 (1984), and DeBlock et al, 
15 EMBO 5:1681 (1984)). These foreign genes retain their normal tissue specific and 
developmental expression pattems. The Agrobacterium tumefaciens-mGdiatQd 
transformation system has also proved to be efficient for transfer of genetic material, 
including many dicotyledonous plant species (Barton et al. Cell 52:1033(1983); Chang 
et al, Planta 5:551-55^ (1994)) and monocotyledonous plants, e.g., in plants in the 
20 Liliaceae and Amaryllidaceae families (Hooykaas-Van Slogteren et al , Nature 311 :763- 
764 (1984)) and in Dioscorea bulbifera (yam) (Schafer et al. Nature 527:529-532 
(1987)). 
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Identification of conditional mutations 

Conditional mutations that produce phenotypes only after imposing specific 
experimentally controlled conditions can also be generated using the procedxires 
described in this application. These conditional mutations can be useful in enabling 
5 more detailed investigation of gene function and access to a wider range of phenotypes 
that become possible as a consequence of the inherent ability to precisely control the 
timing and degree of function of conditionally mutant gene products. Examples of 
conditional mutations include the creation of cold or heat sensitive mutant cells or 
organisms that exhibit the loss of mutant gene function and the consequent appearance 
10 of mutant phenotypes only under conditions of depressed or elevated temperature, 
respectively (references 1-6), or the creation of chemically destabilized alleles that 
depend upon, for example, DMSO exposure, to imcover the altered function of the 
mutant alleles (reference 7). 

The examples described above identify alleles that are conditional upon changes 
15 in the environment of the cell or organism. Other alleles that are conditional upon 
changes that are intrinsic to the mutated cell can also be identified. In this embodiment, 
experimentally controlled changes in the activity of components of the cell or organism 
would alter the function of conditionally mutant gene products, and this regulation of 
mutant gene function would serve to also regulate the appearance of the mutant 
20 phenotype. For example, Hsp90 and other chaperonm proteins have been shown to be 
required to maintain the active conformation of many marginally stable proteins 
including proteins that contain destabilizing sequence changes as a result of mutation 
(reference 8-11). Engineering cells to express Hsp90 only under the regulation of an 
inducible promoter (for example by using tetracycline, ecdysone, or other inducible 
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promoter systems to control Hsp90 gene expression), or treating cells with chemicals 
that abbrogate Hsp90 function would generate cells in which many mutant proteins 
could be destabilized and their loss of function phenotypes revealed by experimentally 
controlled manipulation of Hsp90 activity (references 8-11). Thus the expression of 
5 mutant phenotypes would become dependent upon the experimentally induced reduction 
in Hsp90 or other chaperonin activity. Hsp90 activity might also be manipulated, for 
example, by creating cells that express their only Hsp90 protein as a fusion protein 
consistmg of a steroid hormone binding (or other regvilatory) domain fused to Hsp90 
protein. Such a regulatory domain-chaperonin fusion protein would only exhibit 
10 chaperone active when bound with the appropriate steroid hormone (or other regulatory 
Ugand; references 12-15) and conditional mutations and phenotypes would depend upon 
the concentration of the regulatory hgand in these cells. 1 : Tasaka SE, Suzuki DT. 
Genetics. 1973 Jul;74(3):509-20; 2: Suzuki DT. Science. 1970 Nov 13;170(959):695- 
706; 3: Suzuki DT, Pitemick LK, Hayashi S, TarasoffM, Baillie D, Erasmus U. Proc 
15 Natl Acad Sci USA. 1967 Apr;57(4):907-12; 4: Pringle JR. Methods Cell Biol. 
1975;12:233-72; 5: Basilico C. Adv Cancer Res. 1977;24:223-66; 6: Meiss HK, 
Talavera A, Nishimoto T. Somatic Cell Genet. 1978 Jan;4(l): 125-30; 7: Poloni D, 
Simanis V. FEBS Lett. 2002 Jan 30;511(l-3):85-9; 8: Morimoto RI, Kline MP, Bimston 
DN, Cotto JJ. Biochem. 1997 32: 17-29; 9: Jakob U, Lilie H, Meyer I, Buchner J. J. 
20 Biol. Chem. 1995 270:7288-94; 10: Rutherford SL, Lindquist S. Nature. 1998 Nov 
26;396(6709):336-42; 11: Queitsch C, Sangster TA, Lindquist S. Nature. 2002 Jun 
6;417(6889):6 18-24; 12: Angrand PO, Woodroofe CP, Buchholz F, Stewart AF. 
Nucleic Acids Res. 1998 Jul l;26(13):3263-9; 13: Tada M, O'Reilly MA, Smith JC. 
Development. 1997 Jun;124(ll):2225-34; 14: Takebayashi H, Oida H, Fujisawa K, 
25 Yamaguchi M, Hikida T, Ftikumoto M, Narumiya S, BCakizuka A. Cancer Res. 1996 Sep 
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15;56(18):4164-70; 15: Metzger D, Clifford J, Chiba H, Chambon P. Proc Natl Acad 
Sci USA. 1995 Jul 18;92(15):6991-5. 

It will be imderstood by one of ordinary skill in the relevant arts that other 
suitable modifications and adaptations to the methods, compositions and applications 

5 described herein are readily apparent from the description of the invention contained 
herein in view of information known to the ordinarily skilled artisan, and can be made 
without departing firom the scope of the invention or any embodiment thereof. Having 
now described the present invention in detail, the same will be more clearly imderstood 
by reference to the following examples, which are included herewith for purposes of 

1 0 illustration only and are not intended to be limiting of the invention. 

EXAMPLES 

EXAMPLE 1: Combined Mutagenesis of Embryonic Stem Cells 

Embryonic stem cells (RI) (Nagy et al, Proc. Natl. Acad. Sci. USA P0:8424- 
8428 (1990)) are cultured on feeder cells or gelatin coated plates at 37EC in a 

15 humidified incubator with 7.5% CO2. The cells are cultured in D-MEM/15% FBS/0.1 
mM p-mercaptoethanol/Leukemia Inhibitory Factor (1000 U/ml)/non-essential amino 
acids. The cells are grown to 80% confluence. In 10 plates, each containing 
approximately 4x10^ cells, O^-benzylguanine (O^-BG) is diluted in DMSO and added 
to the media to 10 |j,M fmal concentration. To a separate 10 plates, no O^-BG is added. 

20 Following a 12-16 hour incubation, cells fi-om each plates are trypsinized, diluted to 5 x 
10^ cells/ml, and incubated with various concentrations of ENU for 1 or 2 hours. ENU 
concentrations were 0.2 mg/ml, 0.3 mg/ml, 0.35 mg/ml, and 0.4 mg/ml. The cells that 
are pretreated with O^-BG were incubated with O^-BG during the ENU treatment. 
Following treatment, cells firom each ENU treatinent are washed, trypsinized to 
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dissociate aggregates, counted, and plated in duplicate into gelatin coated tissue culture 
plates at between 100, 1000, 10,000, and 100,000, cells per plate and grown under 
standard conditions described above. The 0^-BG treated cells are grown in the presence 
of O^-BG (10 \xM) for 24 hours, followed by growth in standard media. Colonies are 
5 counted after 10 days. In the dupHcate plates, 6-thioguanine (6-TG) is added at 48 hours 
post-plating to select for HPRT-nuU mutations in order to determme the mutation 
frequency produced by each treatment condition. The number of 6-TG resistant colonies 
divided by the number of colonies present on the duplicate (non-6TG selected) plate 
determines the mutation frequency for a single copy gene. 

10 In an example of the methods of the invention, a plate containing approximately 

1000 clones is selected from an ENU/O^-BG treatment condition that give a mutation 
frequency of 1 in 1000. The cells are expanded to 5 x 10^ cells (50% confluence) under 
standard culture conditions described above. The cells are then washed and incubated 
with pKO-1 retroviral vector at an MOI = 1 for 24 hours. The structure of pKO-1 is 

15 shown in Figure 3B. After 24 hours, the cells are washed and replated in dupUcate into 
fresh media at 10% confluence. One of the duplicate libraries is grown to 40-50% 
confluence and placed tinder G418 selection (250 |Lig/ml). The other duplicate library is 
grown to confluence, trypsinized, frozen in 90%FBS/10%DMSO at 10^ cells/ml, and 
stored in liquid nitrogen. Cells grown under G-418 selection are groAvn, without 

20 replating, for 10-12 days with several media changes. G418 resistant clones are 
counted, trypsinized, and expanded. The library can then be stored in liquid nitrogen or 
used for phenotypic screening. 
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EXAMPLE 2: Combined Mutagenesis of Jxirkat Cells 

Cellular screen to identify gene knockouts that cause resistance to FasL-induced 

apoptosis in Jurkat cells 

5 The combined mutagenesis technology described in this application was 

developed to speed the discovery of gene function in mammalian cell culture by 
coupling distinct mutagenesis techniques to efficiently generate phenotypes and 
simultaneously enable the facile identification of gene mutations that cause these 
phenotypes. combined mutagenesis combines the strengths of various mutagenic 
10 approaches to compensate for distinct deficiencies of each of the individual mutagenesis 
techniques. The embodiment of combined mutagenesis that is described here uses the 
following sequence of steps to identify cell based gene function (illustrated in Figure 1): 

• high density ENU mutagenesis of cells in cultxxre; creates compact libraries of 
cells that contain mutations in nearly all genes 

15 • select only those ENU mutagenized clones for further gene trap mutagenesis that 

do not already display the phenotype 

• gene trap mutagenesis of selected ENU clones 

• selection for gene trap mutagenized clones that now display the phenotype 

• test to ensure the phenotype reverts to wild type upon Cre-mediated removal of 
20 the gene trap 

• sequence identification of the insertionally mutated gene in Cre-revertible gene 
trap clones 

• siRNA confirmation that specific gene knockouts are sufficient to confer 
phenotype 
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Efficient creation of phenotypes and identification of mutated genes derives from 
combining the high efficiency of chemical mutagenesis with the ability of gene trap 
insertional mutagens to provide a sequence tag for identification of the mutated genes 

5 (Figure 15). Chemical mutagens are highly efficient means to generate high densities of 
mutations in mammalian cells, and the size of libraries that contain mutations hi all 
genes can be less than 50 cells. However, the identification of the alleles that cause 
specific phenotypes in these chemically mutated cell lines is extremely difficult. In 
contrast, while mutagenesis by gene trap vectors is much less efficient, these gene trap 

10 mutagens compensate by enabling the easy identification of the genes that are 
responsible for a given phenotype by means of the physical linkage between the 
mutagenized gene and sequences in the insertional mutagen. Cells showing phenotypes 
created by combined mutagenesis carry at least one chemically induced mutation and at 
least one gene trap mutation whose mutual uiteraction is needed to generate the cellular 

15 phenotype (Figure 15). 

To test the combined mutagenesis technology we needed to 1) establish a high 
density chemical mutagenesis protocol, 2) construct an effective and reversible gene trap 
retrovirus, 3) create a combined mutagenesis mutagenized library with these reagents, 
and 4) conduct a proof-of-principle screen of this library to identify cells that have 

20 acquired resistance to Fas-induced apoptosis as a result of the combined mutagenesis. 
The development and testing of combined mutagenesis is described below. 

The ability to identify all genetic mutations that can create a phenotype requires 
that large numbers of mutations be created in as few cells as possible, and tlie high 
efficiency of ENU chemical mutagenesis is suited to this task. We set out to determine 

25 the ENU treatment needed to create 50-100 cell libraries in which all genes are mutated 
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at least once (defined as a Ix knockout library) in Jurkat cells. To characterize the 
mutagenesis we measured both cell survival and the mutation rate in characterizing the 
optimal ENU treatment (Table I). Survival was determined by limiting dilution, and the 
mutation rate was estimated in the population of surviving clones based on the frequency 
5 of knocking out the single copy HPRT gene (assayed by the survival of such HPRT" 
cells upon culture in the presence of 6-thioguanine). As shown in the table below, at an 
ENU dose of 0.2mg/ml the mutation rate is 0.02 HPRT genes/cell and the Ix library 
size is therefore -50 cells. Creation of libraries of these densely mutagenized cells 
provides a genetically sensitized background in which small increments of further 
10 mutagenesis can create phenotypes by knocking out the sole remaining wild type alleles 
of ENU-mutated genes. 



Table I: 



ENU dose 
(mg/ml) 


% survival 


mutation rate 


1x library size* 


0 


98.40% 


3.9x1 0'^(-7) 


2.56x10^6 


0.1 


1.83% 


2.4x1 0A(-3) 


417 


0.125 


0.93% 


4.9x1 0'^C-S) 


204 


0.15 


1.08% 


e.ixio'^C-a) 


164 


0.175 


0.88% 


8.2x1 0'^C-S) 


122 






2.oxio^{-2)' : 


50 


0.25 


0.07% 


3.3x1 0^(-2) 


31 


0.45 


7x10'^(-5) 


NA 


NA 



15 



mutations. Chemical mutagenesis protocol: 

Jurkat cells were treated with lOuM 0-6-BG (Sigma) at 5x10^5 cells/ml for 16 
hours at 37°C in a hijmidified incubator with 5% C02. The pretreated cells were 
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then incubated with desired concentration of ENU (Sigma) in the presence of O- 



6-BG for 2 hours at 37°C with constant shaking. At the end of incubation cells 



were washed with 10-volume IxPBS three times and resuspended in complete 



media with lOuM 0-6-BG. 24 hours later 0-6-BG was washed off, cells were 



5 



resuspended in complete media and seeded onto 96-well plate at 20 cells per well 
or 400-2000 cells per well. Cells were tiien cultured for 10 days. Viability is 



10 



determined by the lower density seeding based on number of wells containing 
growing colonies. To determine mutation jfrequency, cells seeded at the higher 
density were used to select for Hprt-loss of function mutants in the presence of 
40uM 6-TG (Sigma). Number of wells containing growing colonies in the 



presence of 6-TG was scored 7-10 days later. Mutation frequency is calculated as 
# of 6-TG resistant colony divided by # of viable cells seeded. 
Similarly mutated libraries were created by treating Jurkat cells with 0.7mg/ml 
EMS (Sigma) following above protocol A. Ix library size contamed about 100 cells 
1 5 under this condition. 

Karyotype of ENU mutagenized Jurkat cells: 

In some preferred embodiments, cells subjected to mutation are diploid. 
Therefore we determined the chromosome coimt of Jurkat cells and ENU 
mutagenized Jurkat cells by Giemsa stain. -75% of ENU mutagenized Jurkat 
20 cells are diploid or hypodiploid. This is comparable with wild type Jurkat cells. 

In order to prepare the ENU libraries for gene trap mutagenesis, the surviving Jurkat 
clones were then tested to ensure that they remained susceptible to FasL induced 
apoptosis after the 0.2mg/ml ENU mutagenesis; 100 of these Fas-sensitive clones were 
then chosen as targets for gene trap mutagenesis. This exercise demonstrates that we 
25 can create densely mutagenized libraries of cells that do not exhibit the phenotype of 
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interest even though they contain knockouts in one copy of 1-3% of the genes in each 
cell. This combination of dense mutagenesis with lack of phenotype is cracial since it 
creates small and tractable libraries of cells that possess genetic backgrounds in which 
single gene trap knockouts can create a phenotype by disrupting the remaining wild type 
5 copy of genes that determine phenotype. 

Creation of an efficient gene trap vector was needed to serve as an insertional 
knockout mutagen and tag of the insertionally mutated genes for the next phase 
of combined mutagenesis function discovery. The design of the gene trap 
retroviral vector pDK02 that was created for this purpose is shown below 
10 (Figure 16). This vector is designed to trap transcriptional active genes using the 

function of the splice acceptor in the vector. When the vector is integrated 
within a gene, splicing occurs from splice donors that exist at the end of exons in 
the trapped endogenous gene onto the splice acceptor that is provided by the 
vector. Once this splicing event occurs, a fusion transcript will be made that 
1 5 results in the introduction of vector encoded stop codons that cause the 

premature termination of the protein product of the trapped gene. Downstream 
IRES (internal ribosome entry site) activity in the fusion transcript enables the 
reinitiation of translation on the fusion transcript and this translation expresses a 
selectable drug resistance protein. Expression of drug resistance allows selection 
20 to identify the trapping of transcriptionally active genes. Lox signals flank the 

gene trap portion of pDK02, and Cre-mediated recombination of these lox sites 
results in the deletion of the pDK02 gene trap and knockout functions; this 
deletion should revert phenotypes that depend upon insertional mutagenesis and 
aid in the identification of biologically active gene traps, 

25 
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The function of the various components of pDK02 were separately tested to ensure 
activity as follows; 

• IRES: We confirmed that this IRES does function to reinitiate translation, and 
5 showed that it does not have promoter activity in pDK02 

• TK: expression of TK was shown to cause cell death in gancyclovir containing 
media 

• Cre/lox: transfection of ere expression plasmids into cells carrying the integrated 
pDK02 construct resulted in excision events occurring in -80% of cells 

10 • S/A-x-IRES-DR-bGHpA: the gene trap portion of pDK02 was introduced 

into Jurkat cells, followed by selection for drug resistance (neomycin 1.5mg/ml 
for Jurkat), drug resistant clones were harvested and the appearance of drug 
resistance was associated with splicing of endogenous transcripts onto the 
pDK02 splice acceptor in all cases examined by RT-PCR 

15 

Production of the pDK02 retroviral vector occvirred by transfecting pDK02 into 
RetroPack PT67 cells (Clontech) via ExgenSOO (MBI Fermentas). Individual 
' stable colonies were picked and selected for high titer producers. High titer virus 
soup was harvested and used to infect Jtirkat cells following spin-infection 
20 protocol. Briefly, 3x106 Jurkat cells were resuspended in 2ml complete media 

plus 1ml viral soup and polybrene at 8ug/ml. We spininfected at lOOOg for 1 
hour. Cells were then placed in 32C incubator overnight and then incubated at 
37C for 24 hours to allow integration and expression of retrovirus. Titer was 
determined by limiting dilution and found to be 1-3x104 per ml. 
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To assess gene trap efficiency of this vector, we assayed the frequency of gene 
trap events for several known genes by RT-PCR. Two pairs of nested gene specific 
primers (specific for DHFR, HPRT, FasR, and Casp8) were used in this analysis, and the 
resultant RT-PCR products were sequenced to confirm the identity of gene traps. All 

5 were confirmed to be true gene trap events with upstream exons firom the endogenous 
gene correctly spliced onto the splice acceptor in pDK02. hi most cases pDK02 was 
found to integrate within introns near the 5 '-end of the trapped gene. The data shown 
below suggest that the Ix library size for trapping a single allele is '-2x1 OM clones. It is 
easy to generate tihiis number of clones, and this efficiency permits the use of pDK02 to 

10 create genome wide gene trap knockout libraries in each of the Fas-sensitive ENU 
clones described above. 
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Gene trap frequency: 



Cell type 


# clones 
per pool 


# pools 
total 


# clones 
total 


# of pools in which one of the following 
genes is trapped 


HPRT 


DHFR 


FasR 


Casp8 


Jurkat 


7500 


10 


7.5x10m 


2 


3 


3 


9 



Characterization of trapped genes: 



gene name 


HPRT 


DHFR 


FasR 


Casp8 


# traps 


2 


3 


3 


9 


# alleles/cell 


1 


2 


2 


2 


gene size 


40kb 


30kb 


26kb 


55kb 



5 For 75,000 clones 

(2+3+3+9)/(l+2+2+2)=2.4 hits per allele 
for a diploid gene: 4.8 hits per 75,000 clones 
average size for assayed genes: (40+3 0+26+5 5)/4=38kb 
for average gene size of 28kb, Ix library size: 2xlOM 

10 

The next step is to combine these elements to implement a combined 
mutagenesis screen to identify gene functions. Clones from the ENU Jurkat library were 
infected with the pDK02 gene trap retrovirus to create a combined mutagenesis 
knockout library in Jurkat cells (Figure 17). One hundred ENU mutagenized Jurkat 
15 clones (2x genome coverage for ENU mutagen) that had been tested to ensure 
unchanged sensitivity to Fas-induced apoptosis were chosen for this pDK02 retrovirus 
infection and ~10M drug resistant clones (0.5x genome coverage for pDK02 mutagen) 
were obtained for each ENU clone. The final combined mutagenesis library therefore 



-110- 



wo 03/035834 PCT/US02/33714 

consists of -one million clones and this number represents ~lx coverage of the range of 
possible human gene disruptions in these cells. This coverage is sufficient to identify 
the majority (but not all) of the mutations that decrease FasL induced apoptosis in Jurkat 
cells. Under above conditions, the spontaneous Fas resistance rate is ~1 in 2-3x10^ 

5 

Mutation screen: 

ENU clones were first tested for FasAb sensitivity. Only Fas-sensitive clones 
were carried forward for the screen. 100 ENU clones (derived j&om 0.2mg/ml ENU 
treatment, table A.l.) were expanded (~2x library coverage) to 5-10x106 cells per clone. 

1 0 Each ENU clone was infected with pDK02-neo retrovirus to create a pool of cells where 
one allele of a gene can be potentially mutated by the pDK02 vector. About 104 drug 
resistant clones were obtained for each ENU clone, which is about 0.5x library coverage 
for gene trap event. Cells were split into duplicate, A and B, hnmediately after infection 
and were subsequently grown in neomycin containing media for 10-12 days to select for 

1 5 neomycin resistant clones and to deplete endogenovis wild type protein produced prior to 
the mutation of the gene. From each pool A or B, 0.5-1x106 cells (100-200 cells per 
clone) were taken into Fas screen following protocol in CI. Fas resistant clones were 
identified by the outgrowth of the cells diiring the first 2-3 weeks after Fas treatment. 

Fas resistant clones were subjected to re-test by Annexin V staining (Molecular 

20 Probes) and caspase 3 activity assay (Intergen) (Ref: Blood 84, 1415, 1994; J. Biol. 
Chem. 273, 32608-32613, 1998). -70% of the clones confirm the Fas resistant 
phenotype based on these two assays. To determine if the resistant phenotype is due to 
insertion of the pDK02 vector into the genome, ere expression plasmid was transfected 
into each clone and followed by gancyclovir selection to select for excision event. 
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Clones were then analyzed for reversion of the phenotype, thereby demonstrating that 
the trapped gene was responsible for the phenotype. 

FasL induced apoptosis is well studied in Jurkat cells, and modulators of this 
pathway have potential clinical relevance in cancer, stroke and other disease processes. 
5 Previous genetic and biochemical analysis many systems and laboratories have 
estabUshed the basic signal transduction path involved in initiating FasL induced 
apoptosis (Figure 18). We expect to identify combined mutagenesis gene trap knockouts 
of some of the members of the known Fas signaling pathway (Figure 18) as one result of 
this screen. Knockout of novel genes might also reveal previously unknown players 
10 important for regulating apoptosis, and these novel regulatory genes are of great interest 
as potential new targets for therapeutic intervention. 
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To identify gene traps that affect sensitivity to FasL, the doubly mutagenized 
combined mutagenesis library was selected for clones that exhibited resistance to Fas- 
induced apoptosis. Under the FasL selection conditions, there is a low spontaneous 
5 background of resistance that occurs at -1 in 2-3x10^ cells, and we recovered 35 Fas- 
resistant clones from the million clone library. The phenotype of each clone was further 
confirmed by showing that FasL treatment failed to activate the nonnal levels of 
Annexin V staining and Caspase 3 activity as molecular markers of apoptosis. These 
results show that combined mutagenesis can efficiently create reproducible phenotypes 

10 in mammalian cells. 

To determine which of the FasL-resistant cells have acquired their phenotype as 
a direct result of the insertion of the pDK02 vector into the genome, we used 
transfection of Cre expression pl^mids to induce the excision of pDK02 gene trap 
sequences from each of the clones. Cre mediated recombination was followed by 

1 5 gancyclovir selection to detect deletion of the thymidine kinase element along with the 
gene trap functions in pDK02. Clones were then analyzed for reversion of the 
phenotype, and 13 out of the 35 clones showed phenotypic reversion after Cre-mediated 
removal of the gene trap (phenotypic reversion data shown in Figure 19). The cells that 
did not show phenotypic reversion after excision of the gene trap may have acquired 

20 resistance to FasL as the result of a spontaneous mutation, but since such events are 
nearly impossible to identify, these clones were not analyzed further. Thirteen clones 
did show FasL resistance that depended upon gene trap knockout, and these cells were 
further studied to identify the trapped genes and to characterize their role in apoptosis. 
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The 13 clones showing FasL resistance caused by the pDK02 gene trap were 
analyzed using RT-PCR, 5'-RACE and inverse PGR to identify the biologically 
active trapped genes. It has been demonstrated previously that mutation of 
caspase-8 gene leads to FasL resistance (Curr. Biol. 8(18), 1001-1008, 1998). To 
5 test whether caspase-8 has been mutated in any of the FasL resistant clones from 

our screen, we carried out RT-PCR assay with two nested caspase-8 specific 
forward primers and two nested vector specific reverse primers on those clones. 
We identified caspase-8 trap in 2 of the clones. 5 '-RACE and inverse PGR 
identified the trapped genes in most of the other Fas-resistant clones. Many of 
10 these genes had not been previously characterized as having an involvement in 

FasL induced apoptosis, and these genes are expected to play unexpected roles in 
apoptosis and as potential therapeutic targets. 



In summary, we have developed a loss of fimction genetics strategy that enables 
1 5 the discovery of in vivo gene fimction in mammalian cells, and have shown that this 
combined mutagenesis strategy permits the rapid identification of known and novel 
genes that perform these functions. Such a genetic approach should be valuable in the 
identification of gene fimction and in the discovery of new targets for therapeutic 
intervention. 

20 Having now fiiUy described tlie present invention in some detail by way of 

illustration and example for purposes of clarity of understanding, it will be obvious to 
one of ordinary skill in the art that the same can be performed by modifying or changing 
the invention within a wide and equivalent range of conditions, formulations and other 
parameters without affecting the scope of the invention or any specific embodiment 
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thereof, and that such modifications or changes are intended to be encompassed within 
the scope of the appended claims. 

All publications, patents and patent applications mentioned in this specification 
are indicative of the level of skill of those skilled in the art to which this invention 
5 pertains, and are herein incorporated by reference to the same extent as if each 
individual publication, patent or patent appUcation was specifically and individually 
indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 . A method for mutating a gene in a cell comprising: 

subjecting a cell to simultaneous physicochemical mutagenesis and 
insertional mutagenesis such that one allele of said gene is physicochemically mutated 
and the other allele is insertionally mutated. 

2. A method for mutating a gene in a cell comprising: 

subjecting a cell first to insertional mutagenesis and then to 
physicochemical mutagenesis such that one allele of said gene is physicochemically 
mutated and the other allele is insertionally mutated. 

3 . A method for mutating a gene in a cell comprising: 

subjecting a cell to physicochemical mutagenesis and insertional 
mutagenesis such that one allele of said gene is physicochemically mutated and the other 
allele is insertionally mutated. 

4. A method for mutating a gene in a cell and identifying the mutated gene 
comprising: 

subjecting a cell to simultaneous physicochemical mutagenesis and 
insertional mutagenesis such that one allele of said gene is physicochemically mutated 
and the other allele is insertionally mutated and identifying the gene by means of the 
insertional mutagen. 
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5. A method for mutating a gene in a cell and identifying the mutated gene 
comprising: 

subjecting a cell first to insertional mutagenesis and then to 
physicochemical mutagenesis such that one allele of said gene is physicochemically 
mutated and the other allele is insertionally mutated and identifying the gene by means 
of the insertional mutagen. 

6. A method for mutating a gene in a cell and identifying the mutated gene 
comprising: 

subjecting a cell to physicochemical mutagenesis and insertional 
mutagenesis such that one allele of said gene is physicochemically mutated and the other 
allele is insertionally mutated and identifying the gene by means of the insertional 
mutagen. 

7. A method for mutating two or more genes in a cell, wherein the 
mutations are required cxraiulatively to produce a phenotype, comprising: 

subjecting a cell to physicochemical mutagenesis and insertional 
mutagenesis such that said two or more genes are physicochemically mutated or 
msertionally mutated but at least one gene of the two or more genes is insertionally 
mutated. 

8. A method for mutating two or more genes in a cell and identifying at 
least one of the mutated genes, wherein the mutations are required cumulatively to 
produce a phenotype, comprising: 



-117- 



wo 03/035834 PCT/US02/33714 

subjecting a cell to physicochemical mutagenesis and insertional 
mutagenesis such that said two or more genes are physicochemically mutated or 
insertionally mutated but at least one gene of the two or more genes is insertionally 
mutated and identifying at least one insertionally mutated gene by means of the 
insertional mutagen. 

9. A method for correlating a phenotype of a cell wilh a mutated gene 
comprising: 

simultaneously physicochemically and insertionally mutagenizing a cell, 
screening the cell for a desired phenotype, and identifying the mutated gene that 
correlates with the phenotype by means of the insertional mutagen. 

10. A method for correlating a phenotype of a cell with a mutated gene 
comprising: 

physicochemically and insertionally mutagenizing a cell, screening the 
cell for a desired phenotype, and identifying the mutated gene that correlates with the 
phenotype by means of the insertional mutagen, wherein insertional mutagenesis 
precedes physico-chemical mutagenesis.. 

11. A method for correlating a phenotype of a cell with a mutated gene 
comprising: 

physicochemically and insertionally mutagenizing a cell, screening the 
cell for a desired phenotype, and identifying the mutated gene that correlates with the 
phenotype by means of the insertional mutagen. 
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12. A method for selecting a mutated cell from a plurality of cells 
comprising: subjecting a plurality of cells to physicochemical and insertional 
mutagenesis and selecting an insertionally mutated cell by means of the insertional 
mutagen. 

13. A method for creating a homozygous mutation in a multicellular 

organism comprising: 

subjecting a first organism to physicochemical mutagenesis, subjecting an 
organism of the opposite sex to the first organism to insertional mutagenesis, breeding 
the first and second mutated organisms, and obtaining progeny from the breeding 
wherein the progeny is homozygous for a mutation in a gene wherein one allele of the 
gene is mutated by physciochemical mutagenesis and the second allele of the gene is 
mutated by insertional mutagenesis. 

14. A method for mutating a gene in a cell, said cell containing a mutation in 
one allele of said gene induced by a physico-chemical mutagen, said method comprising 
subjecting said cell to an insertional mutagen such that the second allele of said gene is 
insertionally mutated. 

15. A method for mutating a gene in a cell, said cell containing a mutation in 
one allele of said gene induced by an insertional mutagen, said method comprising 
subjecting said cell to a physico-chemical mutagen such that the second allele of said 
gene is physico-chemically mutated. 
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16. A method fdr mutating two or more genes in a cell where said mutations 
are required cumulatively to produce a phenotype, said method comprising subjecting a 
cell to physico-chemical mutagenesis such that a sufficient number of genes is physico- 
chemically mutated to produce said phenotype, said cell containing an insertional 
mutation in at least one of said genes required to produce said phenotype. 

17. A method for mutating two or more genes in a cell and identifying at 
least one of the mutated genes, where said mutations are required cumulatively to 
produce a phenotype, said metiiod comprising subjecting a cell to physico-chemical 
mutagenesis such that a sufficient number of genes is physico-chemically mutated to 
produce said phenotype, said cell containing an insertional mutation in at least one of 
said genes required to produce said phenotype, and identifying at least one insertionally- 
mutated gene by means of the insertional mutagen. 

18. The method of claim 1-17, wherein said physicochemical mutagen is 
selected from the group consisting of UV irradiation, gamma irradiation, x-rays, a 
restriction enzyme, a mutagenic or teratogenic chemical, a DNA repair mutagen, a DNA 
repair inhibitor, an error-prone DNA replication protein, N-elhyl-N-nitrosourea (ENU), 
ethylmeliianesulphonate (EMS) and ICR191. 

19. The method of claim 18, wherein said error-prone DNA replication 
protein is an error-prone DNA polymerase. 

20. The method of claim 18, wherein said physicochemical mutagen is 
selected ftom the group consisting of ENU and EMS. 
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21. The method of claim 1-17, wherein said insertional mutagen comprises 
one or more physical or functional nucleotide sequences selected from the group 
consisting of one or more multiple cloning sites, one or more transcription termination 
sites, one or more transcriptional regulatory sequences, one or more translational signal 
sequences, one or more open reading frames (ORFs), one or more sequences mutating 
ORFs, one or more stop codons, one or more sequences mutating or eliminating stop 
codons, one or more mRNA destabilizing elements, one or more hairpin sequences, one 
or more sequences mutating or eliminating hairpins, one or more reporter genes, one or 
more splice acceptor sequences, one or more splice donor sequences, one or more 
intemal ribosome entry sites (IRES), one or more transposon sequences, one or more 
site-specific recombination site sequences, one or more restriction enzyme sequences, 
one or more nucleotide sequences encoding a fusion partner protein or peptide, one or 
more selectable markers or selection modules, one or more bacterial sequences useful 
for propagating said vector in a host cell, one or more 3* gene traps, one or more 5' gene 
traps, one or more nucleotide sequences encoding localization signals, one or more- 
origins of replication, one or more protease cleavage sites, one or more desired proteins 
or peptides encoded by a gene or a portion of a gene, and one or more sequences 
encoding one or more 5' or 3' polynucleotide tails. 

22. The method of claim 21, wherein said transcriptional regulatory 
sequences are selected from tlie group consisting of promoters, enhancers and 
repressors. 
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23. The method of claim 21, wherein said one or more 5' or 3' polynucleotide 
tails are one or more poly (A) tails. 



24. The method of claim 21, wherein said insertional mutagen comprises at 
least one stop codon. 

25. The method of claim 21, wherein said insertional mutagen comprises at 
least one splice acceptor site. 

26. The method of claim 21 , wherein said vector comprises at least one splice 
donor site. 

27. The method of claim 25, wherein upon integration of said insertional 
mutagen into the genome of said ceil, said spUce acceptor site is operably linked to a 
splice donor site of said gene. 

28. The method of claim 26, wherein upon integration of said insertional 
mutagen into the genome of said cell, said splice donor site is operably linked to a splice 
acceptor site of said endogenous gene. 

29. The method of claim 21, wherein said insertional mutagen comprises at 
least one 5' gene trap. 

30. The method of claim 21, wherein said insertional mutagen comprises at 
least one selectable marker. 
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31. The method of claim 21, wherein said insertional mutagen comprises at 
least one reporter gene. 

32. The method of claim 31, wherein said reporter gene is selected from the 
group consisting of P-lactamase, p-galactosidase, luciferase, chloramphenicol acetyl 
transferase, green fluorescent protein and its derivatives, yellow fluorescent protein and 
its derivatives, blue fluorescent protein and its derivatives, and red fluorescent protein 
and its derivatives. 

33. The method of any of claims 3, 6, 7, 8, 10, 11, 12, wherein said cell is 
contacted with said physicochemical mutagen and said insertional mutagen 
simultaneously. 

34. The method of any of claims 3, 6, 7, 8, 11, 12, 14-17, wherein said cell is 
contacted with said physicochemical mutagen and said mutagenic insertional mutagen 
sequentially. 

35. The method of claim 34, wherein said cell is contacted with said 
physicochemical mutagen before being contacted wifli said msertional mutagen. 

36. The method of claim 34, wherein said cell is contacted with said 
physicochemical mutagen after being contacted with said insertional mutagen. 
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37. The method of any of claims 1-17, further comprising contacting said cell 
more than once with said physicochemical mutagens and/or more than once with said 
insertional mutagens. 

38. The method of claim 37, wherein said additional mutagenic treatments 
comprise contacting said cell with one or more physicochemical mutagens. 

39. The method of claim 37, wherein said additional mutagenic treatments 
comprise contacting said cell with one or more insertional mutagens. 

40. Tlie method of any of claims 1-17, wherein said insertional mutagen is 
contained within a virus or transposon. 

41. The method of claim 40, wherein said virus is selected from the group 
consisting of an adenovirus, an adeno-associated virus, a retrovirus, an alpha- 
herpesvirus, a lentivirus, a pseudorabies virus and a herpes simplex virus. 

42. The method of claim 41 , wherein said virus is a retrovirus. 

43 . The method of claim 42, wherein said retrovirus is a lentivirus. 

44. A cell in vitro where one allele of a gene in said cell is insertionally 
mutated and the second allele of said gene is physicochemically mutated. 
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45. A cell containing a mutation in two or more genes where at least one of 
the mutations is an insertional mutation, wherein the mutations are cumulatively 
required to produce a phenotype in said cell. 

46. The cell of claim 44 or 45 wherein said cell is an embryonic stem cell or 
germ cell. 

47. The cell of claim 44 or 45, wherein said cell has been treated with at least 
one physicochemical mutagen selected from the group consisting of UV irradiation, 
gamma irradiation, x-rays, a restriction enzyme, a mutagenic or teratogenic chemical, a 
DNA repair mutagen, a DNA repair inhibitor, an error-prone DNA repUcation proteui, 
ENU, EMS and ICR191. 

48. The cell of claim 47, wherein said error-prone DNA replication protein is 
an error-prone DNA polymerase. 

49. The cell of claim 47, wherein said cell has been treated with at least one 
physicochemical mutagen selected from the group consisting of ENU and EMS. 

50. The cell of claim 44 or 45, wherein said insertional mutagen comprises 
one or more physical or functional nucleotide sequences selected from the group 
consisting of one or more multiple cloning sites, one or more transcription termmation 
sites, one or more transcriptional regulatory sequences, one or more translational signal 
sequences, one or more ORFs, one or more sequences mutating ORFs, one or more stop 
codons, one or more sequences mutating or eluninating stop codons, one or more mRNA 
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destabilizing elements, one or more hairpin sequences, one or more sequences mutating 
or eliminating hairpins, one or more reporter genes, one or more splice acceptor 
sequences, one or more splice donor sequences, one or more IRES, one or more 
transposon sequences, one or more site-specific recombination site sequences, one or 
more restriction enzyme sequences, one or more nucleotide sequences encoding a fusion 
partner protein or peptide, one or more selectable markers or selection modules, one or 
more bacterial sequences useful for propagating said vector in a host cell, one or more 3* 
gene traps, one or more 5' gene traps, one or more nucleotide sequences encoding 
localization signals, one or more origins of replication, one or more protease cleavage 
sites, one or more desired proteins or peptides encoded by a gene or a portion of a gene, 
and one or more sequences encoding one or more 5* or 3' polynucleotide tails. 

51. The cell of claim 50, wherein said transcriptional regulatory sequences 
are selected from the group consisting of promoters, enhancers and repressors. 

52. The cell of claim 50, wherein said one or more 5' or 3* polynucleotide 
tails are one or more poly (A) tails. 

53. The cell of claim 44, wherein said insertional mutagen comprises at least 
one stop codon. 

54. The cell of claim 44, wherein said insertional mutagen comprises at least 
one splice acceptor site. 
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55. The cell of claim 44, wherein said insertional mutagen comprises at least 
one splice donor site. 



56. The cell of claim 54, wherein upon integration of said insertional 
mutagen into the genome of said cell, said splice acceptor site is operably linked to a 
splice donor site of said gene. 

57. The cell of claim 55, wherein upon integration of said insertional 
mutagen into the genome of said cell, said splice donor site is operably linked to a spUce 
acceptor site of said gene. 

58. The cell of claim 44, wherein said insertional mutagen comprises at least 
one 5' gene trap cassette. 

59. The cell of claim 44, wherein said insertional mutagen comprises at least 
one selectable marker. 

60. The cell of claim 44, wherein said insertional mutagen comprises at least 
one reporter gene. 

61. The cell of claim 60, wherein said reporter gene is selected from the 
group consisting of p-lactamase, p-galactosidase, luciferase, chloramphenicol acetyl 
transferase, green fluorescent protein and its derivatives, yellow fluorescent protein and 
its derivatives, blue fluorescent protein and its derivatives, and red fluorescent protein 
and its derivatives. 
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62. The cell of claim 44, wherein said cell has been subjected to more than 
one physicochemical and/or insertional mutagenesis treatments. 

63. The cell of claim 62, wherein said one or more additional mutagenic 
treatments comprise contacting said cell with one or more physicochemical mutagens. 

64. The cell of claim 62, wherein said one or more additional mutagenic 
treatments comprise contactmg said cell with one or more insertional mutagens. 

65. The cell of claim 44, wherein said insertional mutagen is contained 
within a virus. 

66. The cell of claim 65, wherem said virus is selected from the group 
consisting of an adenovims, an adeno-associated virus, a retrovirus, an alpha-, 
herpesvirus, a lentivirus, a pseudorabies virus and a herpes simplex vims. 

67. The cell of claim 66, wherein said vnus is a retrovirus. 

68. The cell of claim 67, wherein said retrovims is a lentivirus. 

69. The cell of claim 44, wherein said cell is selected from the group 
consisting of a human cell, a non-human primate cell, a mouse cell, a rat cell, an avian 
cell, a bovine cell, a porcine cell, an ovine cell, a canine cell, a feline cell, a plant cell, a 
fish cell, an insect cell and a worm cell. 
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70. A library of two or more physicochemically mutated cells, one or more of 
which cells has been contacted with one or more insertional mutagens, wherein one or 
more insertional mutagens has been inserted into the genome of at least one cell within 
said library. 

71 . A library of one or more insertionally mutated cells each comprising one 
or more insertional mutagens inserted into the genome of said cell, wherein one or more 
cells from the library has been treated with one or more physicochemical mutagens. 

72. The library of claim 70 or claim 71, wherem said one or more insertional 
mutagens each comprises one or more physical or functional nucleotide sequences 
selected from the group consisting of one or more multiple cloning sites, one or more 
transcription termination sites, one or more transcriptional regulatory sequences, one or 
more translational signal sequences, one or more open reading frames (ORFs), one or 
more sequences mutating ORFs, one or more stop codons, one or more sequences 
mutatmg or eliminating stop codons, one or more mRNA destabilizing elements, one or 
more hairpin sequences, one or more sequences mutating or eliminating hairpins, one or 
more reporter genes, one or more splice acceptor sequences, one or more splice donor 
sequences, one or more intemal ribosome entry sites (IRES), one or more transposon 
sequences, one or more site-specific recombination site sequences, one or more 
restriction en2yme sequences, one or more nucleotide sequences encoding a fiision 
partner protein or peptide, one or more selectable markers or selection modules, one or 
more bacterial sequences useful for propagating said vector in a host cell, one or more 3' 
gene traps, one or more 5' gene traps, one or more nucleotide sequences encoding 



-129- 



wo 03/035834 PCT/US02/33714 
localization signals, one or more origins of replication, one or more protease cleavage 
sites, one or more desired proteins or peptides encoded by a gene or a portion of a gene, 
and one or more sequences encoding one or more 5' or 3' polynucleotide tails. 

73 . The library of claim 72, wherein said transcriptional regulatory sequences 
are selected from the group consisting of promoters, enhancers and repressors. 

74. The library of claim 72, wherein said one or more 5' or 3' polynucleotide 
tails are one or more poly (A) tails. 

75. The library of claim 70 or claun 71, wherein said one or more insertional 
mutagens each comprises at least one stop codon. 

76. The library of claim 70 or claim 71, wherein said one or more insertional 
mutagens each comprises at least one splice acceptor site. 

77. The library of claim 70 or claim 71, wherein said one or more insertional 
mutagens each comprises at least one splice donor site. 

78. The library of claim 76, wherein upon introduction of said insertional 
mutagen into the genome of said cell, said splice acceptor site is operably linked to a 
splice donor site of said gene. 
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79. The library of claim 77, wherein upon introduction of said insertional 
mutagen into the genome of said cell, said splice donor site is operably linked to a splice 
acceptor site of said gene. 

80. The library of claim 70 or claim 71, wherein said one or more insertional 
mutagens each comprises at least one 5' gene trap. 

81 . The Ubrary of claim 70 or claim 71, wherein said one or more insertional 
mutagens each comprises at least one selectable marker. 

82. The library of claim 70 or claim 71, wherein said one or more insertional 
mutagens each comprises at least one reporter gene. 

83. The library of claim 82, wherein said reporter gene is selected from the 
group consisting of p-lactamase, p-galactosidase, luciferase, chloramphenicol acetyl 
transferase, green fluorescent protein and its derivatives, yellow fluorescent protein and 
its derivatives, blue fluorescent protein and its derivatives, and red fluorescent protein 
and its derivatives. 

84. The library of claim 70 or claim 71, wherein said physicochemical 
mutagen is selected from the group consisting of UV irradiation, gamma irradiation, 
x-rays, a restriction enzyme, a mutagenic or teratogenic chemical, a DNA repair 
mutagen, a DNA repair inhibitor, an error-prone DNA replication protein, ENU, EMS 
andICR191. 
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85. The library of claim 81, wherein said error-prone DNA repUcation 
protein is an error-prone DNA polymerase. 



86. The library of claim 67 or claim 68, wherein said physicochemical 
mutagen is selected from the group consisting of ENU and EMS. 

87. The library of claim 67 or claim 68, wherein said library comprises at 
least 50 physicochemically mutated clones and at least 1000 insertionally mutated 
clones. 

88. The method of any of claims 1-12, 14-17 wherein said cell is in vitro. 

89. The method of any of claims 1-12, 14-17 wherein said cell is in vivo. 

90. The method of any of claims 1-12, 14-17 wherein said cell is an isolated, 

cell. 

91 . The method of any of claims 1-17, wherem said insertional mutagenesis 
is accomplished by a method selected from the group consisting of transposon insertion 
and retrovirus insertion. 

92. The method of claim 91, wherein said msertional mutagenesis is 
accomplished by retrovirus insertion. 
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93. The method of claim 91, wherein said retrovirus insertion is 
accomplished using a lentivirus. 



94. The method of any of claims 1-17, wherein said cell is a mammalian cell. 

95. The method of claim 94, wherein said mammalian cell is selected from 
the group consisting of a hmnan cell, a mouse cell, a rat cell, a rabbit cell, a bovine cell, 
an ovine cell, a porcine cell, a canine cell and a feline cell. 

96. The method of any of claims 1-17, wherein said cell is an embryonic 
stem cell. 

97. A method for making a transgenic organism containing a cell 
homozygous for a mutation in at least one gene, said method comprising: 

subjecting a cell to insertional mutagenesis in vitro so as to produce a 
mutated cell comprising a mutation in a first allele of a gene in the genome of said cell; 

subjecting said cell to non-insertional mutagenesis to induce a mutation 
in a corresponding second allele in the genome of said cell; and 

producing a transgenic organism from said cell, wherein said transgenic 
organism is homo2ygous for a mutation in said gene. 

98. A method for making a transgenic organism containing a cell with a 
mutation in two or more genes in a cell , the mutations required cumulatively to produce 
a phenotype in said cell or in said organism, said method comprising: 



-133- 



wo 03/035834 PCT/US02/33714 

subjecting a cell to insertional mutagenesis in vitro so as to produce a 
mutated cell comprising a mutation in one or more genes in the genome of said cell; 

subjecting said cell to non-insertional mutagenesis to induce a mutation 
in one or more genes in the genome of said cell; and 

producing a transgenic organism from said cell, wherein said transgenic 
organism is mutated in two or more genes in a cell. 

99. The method of claim 97 or claim 98, wherein said insertional 
mutagenesis is accomplished using at least one insertional mutagen under conditions 
such that said insertional mutagen is introduced into the genome of said donor cell, 
thereby mutating one allele. 

100. The method of claim 99, wherein said insertional mutagen comprises one 
or more physical or functional nucleotide sequences selected from the group consisting 
of one or more multiple cloning sites, one or more transcription termination sites, one or 
more transcriptional regulatory sequences, one or more translational signal sequences, 
one or more open reading frames (ORFs), one or more sequences mutating ORFs, one or 
more stop codons, one or more sequences mutating or eliminating stop codons, one or 
more mRNA destabilizing elements, one or more hairpin sequences, one or more 
sequences mutating or elraiinating hairpins, one or more reporter genes, one or more 
splice acceptor sequences, one or more splice donor sequences, one or more intemal 
ribosome entry sites (IRES), one or more transposon sequences, one or more site- 
specific recombination site sequences, one or more restriction enzyme sequences, one or 
more nucleotide sequences encoding a fusion partner protein or peptide, one or more 
selectable markers or selection modules, one or more bacterial sequences useful for 
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propagating said vector in a host cell, one or more 3^ gene trap cassettes, one or more 
nucleotide sequences encoding localization signals, one or more origins of replication, 
one or more protease cleavage sites, one or more desired proteins or peptides encoded by 
a gene or a portion of a gene, and one or more sequences encoding one or more 5' or 3' 
polynucleotide tails. 

101. The method of claim 100, wherein said transcriptional regulatory 
sequences are selected from the group consisting of promoters, enhancers and 
repressors. 

102. The method of claim 101, wherein said one or more 5* or 3' 
polynucleotide tails are one or more poly (A) tails. 

103. The method of claim 101, wherein said msertional mutagen comprises at 
least one stop codon. 

104. The method of claim 101, wherein said insertional mutagen comprises at 
least one splice acceptor site. 

105. The method of claim 101, wherein said insertional mutagen comprises at 
least one splice donor site. 

106. The method of claim 104, wherein upon mtroduction of said insertional 
mutagen into the genome of said cell, said spUce acceptor site is operably linked to a 
splice donor site of said gene. 



-135- 



wo 03/035834 



PCT/US02/33714 



107. The method of claim 105, wherein upon introduction of said insertional 
mutagen into the genome of said cell, said spUce donor site is operably hnked to a spUce 
acceptor site of said gene. 

108. The method of claim 99, wherein said insertional mutagen comprises at 
least one 3' gene trap cassette. 

109. The method of claim 99, wherein said insertional mutagen comprises at 
least one selectable marker. 

110. The method of claim 99, wherein said insertional mutagen comprises at 
least one reporter gene. 

111. The method of claim 110, wherein said reporter gene is selected from the 
group consisting of p-lactamase, p-galactosidase, luciferase, chloramphenicol acetyl 
transferase, green fluorescent protein and its derivatives, yellow fluorescent protein and 
its derivatives, blue fluorescent protein and its derivatives, and red fluorescent protein 
and its derivatives. 

112. The method of claim 97 or claim 98, wherein said non-insertional 
mutagenesis is accompUshed using at least one physicochemical mutagen. 

113. The method of claim 112, wherein said physicochemical mutagen is 
selected from the group consisting of UV irradiation, gamma irradiation, x-rays, a 
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restriction enzyme, a mutagenic or teratogenic chemical, a DNA repair mutagen, a DNA 
repair inhibitor, an error-prone DNA r^lication protein, ENU, EMS and ICR191. 



114. The metibod of claim 113, wherein said error-prone DNA replication 
protein is an error-prone DNA polymerase. 

115. The method of claim 112, wherein said physicochemical mutagen is 
selected from the group consisting of ENU and EMS. 

116. The method of claim 98 or claim 99, wherein said insertional 
mutagenesis is accompHshed by a method selected from the group consisting of 
retroviral insertion and transposon insertion. 

117. The method of claim 98 or claim 99, wherein said insertional 
mutagenesis is accomplished by retroviral insertion. 

118. The method of claim 117, wherein said retroviral insertion is 
accomplished using a lentivirus. 

119. The method of claim 97 or claim 98, wherein said cell is a mammalian 

cell. 

1 20. The method of claim 97 or claim 98, wherein said cell is a plant cell. 

121 . The method of claim 97 or claim 98, wherein said cell is a yeast cell. 
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122. The method of claim 97 or claim 98, wherein said cell is an embryonic 
stem cell. 

123. The method of claim 1 17, wherein said mammalian cell is selected from 
the group consisting of a hxmian cell, a non-human primate cell, a mouse cell, a rat cell, 
a rabbit cell, a bovine cell, an ovine cell, a porcine cell, a canine cell and a feline cell. 

124. The method of claim 97 or claim 98, wherein said transgenic organism is 
selected from the group consisting of a transgenic non-human mammal, a transgenic 
bird, a transgenic fish, a transgenic worm, a transgenic insect, a transgenic plant and a 
transgenic yeast. 

125. The method of claim 124, wherein said transgenic non-human mammal is 
selected from the group consisting of a transgenic mouse, a transgenic rat, a transgenic 
rabbit, a transgenic cow, a transgenic sheep, a transgenic pig, a transgenic dog and a 
transgenic cat. 

126. The method of claim 124, wherein said transgenic fish is a transgenic 
zebrafish. 

127. The method of claim 124, wherein said transgenic worm is a transgenic 

C elegans. 
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128. The method of claim 124, wherein said transgenic msect is selected from 
the group consisting of a transgenic Drosophila, a transgenic Spodoptera and a 
transgenic Trichoplusa. 

129. The method of claim 124, wherein said transgenic plant is a transgenic 
dicotyledenous plant. 

130. The method of claim 124, wherein said transgenic plant is a transgenic 
monocotyledenous plant. 

131. A method for making an organism comprising at least one cell, wherein 
said cell is homozygous for a mutation in at least one gene, said method comprising 
obtaining the cell of claim 49 and introducing said cell into an organism. 

132. The method of claim 131, wherein said cell is a mammalian cell. 

133. The method of claim 131, wherein said cell is aplant cell. 

134. The method of claim 131, wherein said cell is a yeast cell. 

135. The method of claim 131, wherein said cell is an embryonic stem cell. 

136. The method of claim 132, wherein said mammalian cell is selected from 
the group consisting of a himian cell, a non-human primate cell, a mouse cell, a rat cell, 
a rabbit cell, a bovine cell, an ovine cell, a porcine cell, a canine cell and a feline cell. 
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137. The method of claim 131, wherein said organism is selected from the 
group consisting of a mammal, a bird, a fish, a worm, an insect, a plant and a yeast. 

138. The method of claim 137, wherein said mammal is selected from the 
group consisting of a human, a non-human primate, a mouse, a rat, a rabbit, a cow, a 
sheep, a pig, a dog and a cat. 

139. The method of claim 1 37, wherein said mammal is a human. 

140. The method of claim 1 37, wherein said fish is a zebrafish. 

141 . The method of claim 137, wherein said worm is a C. elegans. 

142. The method of claim 137, whereui said insect is selected from the group 
consisting of sl Drosophila, a Spodoptera and a Trichoplusa, 

1 43 . The method of claim 137, wherein said plant is a dicotyledenous plant. 

144. The method of claim 137, wherein said plant is a monocotyledenous 

plant. 

145. A cell in vitro that is homozygous for at least one mutation in an 
endogenous gene, wherein said cell comprises an insertional mutation in a furst allele of 
said gene and a non-insertional mutation in a second allele of said gene. 
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146. The cell of claim 145, wherein said cell is a mammalian cell. 

147. The cell of claim 145, wherein said cell is a plant cell. 

148. The cell of claim 145, wherein said cell is a yeast cell. 

149. The cell of claim 145, wherein said cell is an embryonic stem cell. 

150. The cell of claim 146, wherein said mammalian cell is selected from the 
group consisting of a human cell, a non-human primate cell, a mouse cell, a rat cell, a 
rabbit cell, a bovine cell, an ovine cell, a porcine cell, a canine cell and a feline cell. 

151. The cell of claim 1 45 , wherein said cell is an isolated cell. 

152. The cell of claim 145, wherein said insertional mutation is selected from 
the group consisting of an integration mutation, a chromosomal translocations, a deletion 
mutation, a DNA end joining, a double strand break repair, a bridge-break-fusion, 
concatemerization of transfected polynucleotides, retroviral insertion and transposon 
insertion. 

153. The cell of claim 145, wherein said non-insertional mutation is 
accomplished using at least one physicochemical mutagen. 
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154. The cell of claim 153, wherein said physicochemical mutagen is selected 
from the group consisting of UV irradiation, gamma irradiation, x-rays, a restriction 
enzyme, a mutagenic or teratogenic chemical, a DNA repair mutagen, a DNA repair 
inhibitor, an error-prone DNA replication protein, ENU, EMS and ICR191. 

155. The cell of claim 154, wherein said error-prone DNA replication protein 
is an error-prone DNA polymerase. 

156. The cell of claim 153, wherein said physicochemical mutagen is selected 
from the group consisting of ENU and EMS 

157. A non-human organism comprising the cell of claim 145. 

158. The non-human organism of claim 157, wherein said organism is selected 
from the group consisting of a non-human mammal, a bird, a fish, a worm, an insect, a 
plant and a yeast. 

159. The non-human organism of claim 158, wherein said non-human 
mammal is selected from the group consistmg of a non-human primate, a mouse, a rat, a 
rabbit, a cow, a sheep, a pig, a dog and a cat. 

1 60. The non-human organism of claim 158, wherein said fish is a zebrafish. 

161. The non-human organism of claim 158, wherein said worm is a C. 
elegans. 
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162. The non-human organism of claim 158, wherein said insect is selected 
from the group consisting of a Drosophila, a Spodoptera and a Trichoplusa. 

163. The non-hximan organism of claim 158, wherein said plant is a 
dicotyledenous plant. 

164. The non-hxunan organism of claim 158, wherein said plant is a 
monocotyledenous plant. 

165. The non-human organism of claun 157, wherein said organism is a 
transgenic organism. 

166. A cell produced by the method of any of claims 1-17. 

1 67. A library of cells produced by the method of any of claims 1-17. 

1 68. A transgenic organism produced by the cell of any of claims 1-17, 97, 98. 

169. An organism produced by the method of claims 13-16 or 131. 

170. The method of any of claims 1-12 or 14-17 wherein one or more of said 
mutagenesis events is carried out in vitro. 
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